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MODULATORS OF TISSUE REGENERATION 

Related Applications 

This is a continuation-in-part of prior U.S. Provisionals S.N. 60/047,490 and S.N. 
60/047,491, both filed May 23, 1997. The teachings of both earlier-filed Provisional patent 
applications are incorporated herein by reference. 

Field of the Invention 

The invention relates generally to gene expression products, including nucleic acids 
(e.g., RNAs) and polypeptides that are upregulated in injured or regenerating tissues. 
Further, the invention relates generally to cPNAs and other nucleic acids encoding 
polypeptides that are upregulated in injured or regenerating tissues. 

Background of the Invention 

The processes of tissue development during embryogenesis, and of tissue repair 
and/or regeneration following injury or insult are presently the topic of intense 
investigation. Both processes involve a dynamic remodeling of tissue architecture, which 
is triggered and mediated by numerous biological interactions, including cell-cell contact, 
cell-matrix contact, release of soluble biological response modifiers, synthesis of structural 
components, and many other changes in cell phenotype, including changes in gene 
expression. Many of the factors involved in tissue genesis and in the response to tissue 
insult remain unknown or poorly understood. 

Several systems have been developed for modeling the events triggered by insult to 
particular tissues, and/or by particular types of insult. For example, several investigators 
have described events occurring in mammalian kidney tissue exposed to an ischemia- 
reperfusion insult. The kidney is able to repair damage to the proximal tubule epithelium 
through a complex series of events involving cell death, proliferation of surviving proximal 
tubule epithelial cells, formation of poorly differentiated regenerative epithelium over the 
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denuded basement membrane, and differentiation of the regenerative epithelium to form 
fully functional proximal tubule epithelial cells (Wallin et al., Lab. Invest. 66:474-484, 
1992; Witzgall et aL, MoL Cell. Biol. 13:1933-1942, 1994; Ichimura et aL, Am. J. Physiol. 
269:F653-662, 1995; Thadhani et al., N. Engl. J. Med. 334:1448-1460, 1996). Growth 
5 factors such as IGF, EGF, and HGF have been implicated in this process of repair, as has 
the endothelial cell adhesion molecule ICAM-1 . However, the mechanisms by which the 
tubular epithelial cells are restored, either functionally and morphologically, remain poorly 
understood. 

There accordingly is a need for an improved understanding of the biological 
10 processes of tissue repair or regeneration. That is, there is a need to identify factors that are 
indicative of the initiation, development and resolution of normal wound healing or normal 
biological response to tissue insult. Similarly, there is a need to identify factors that are 
indicative of pathologies of abnormal responses to tissue insult or other stimulus, including 
autoimmune or other dysregulation pathologies, as well as pathologies associated with 
15 abnormal tissue growth including neoplastic growth. Further, there is a need to identify 
factors that constitute appropriate targets for therapeutic intervention to stimulate, 
modulate, enhance, suppress or otherwise manipulate biological responses to tissue insult. 
Similarly, there is a need to identify factors that constitute appropriate targets for 
therapeutic intervention to manipulate abnormal or dysregulated responses to tissue insult 
20 or other stimulus, including abnormal tissue growth associated with cancer (neoplasia) and, 
conversely, abnormal tissue quiescence associated with degenerative diseases. 

Summary of the Invention 

It is an object of this invention to identify factors that are indicative of the initiation, 
development and resolution of normal wound healing or normal biological response to 
25 tissue insult. A specific object of this invention is to identify products (e.g., RNAs, 

cDNAs, polypeptides) of genes that are upregulated in the context of healing tissue, or in 
tissue exposed to insult or injury. A particular object is to identify products of genes that 
are upregulated in mammalian kidney tissue exposed to insult or injury, such as ischemia- 
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reperfusion injury. A second specific object of this invention is to identify products of 
genes that arc upregulated in pathological conditions associated with abnormal responses 
to tissue insult or other stimulus, including autoimmune or other dysregulation pathologies, 
as well as pathologies associated with abnormal tissue growth including neoplastic growth. 
5 A third specific object is to identify gene products that constitute appropriate targets for 
therapeutic intervention to stimulate, modulate, enhance, suppress or otherwise manipulate 
biological responses to tissue insult. A particular object is to identify gene products for 

therapeutic intervention to manipulate responses of mammalian kidney tissue to tissue 

i 1 

injury, such as ischemia-reperfusion injury. A fourth specific object is to identify gene 
10 products that constitute appropriate targets for therapeutic intervention to manipulate 
abnormal or dysregulated responses to tissue insult or other stimulus, including abnormal 
tissue growth associated with cancer (neoplasia) and, conversely, abnormal tissue 
quiescence associated with degenerative diseases. Thus, other particular objects of the 
invention include the identification of gene products as targets for therapeutic intervention 
15 in the clinical management (including prophylaxis, maintenance and treatment) of kidney 
diseases, including diseases involving renal failure, and of cancers derived from or 
affecting renal tissue. 

The present invention rests on the discovery that the expression levels of numerous 
genes, including many genes heretofore unknown and/or uncharacterized, are upregulated 

20 in mammalian kidney tissue that has been exposed to tissue insult or injury. More 

specifically, the present invention rests on the discovery of a novel class of Kidney Injury- 
associated Molecules (each member of the class is henceforth called a "KIM"). KIMs are 
polypeptides encoded by genes whose expression levels are upregulated in the mammalian 
kidney upon exposure to tissue injury, particularly after exposure to ischemia-reperfusion 

25 injury. Any KIM is useful as an indicator of tissue status (preferably renal tissue status) or 
of a change therein (e.g, exposure to tissue injury, or the occurrence or stage of a tissue 
repair process). Any KIM further is useful as a target for therapeutic intervention to 
manipulate, whether by an agonist or an antagonist, a normal or abnormal tissue response 
to insult or other stimulus. 
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The invention accordingly provides, in a first aspect, purified and isolated nucleic 

acid molecules encoding all or a unique fragment of a KIM. In one embodiment, the 

i 

i 

present nucleic acids are RNAs. In another, they are DNA molecules, such as cDNAs. In 
another embodiment, the invention provides the complementary strands of nucleic acids 
1 1 5 encoding all or a unique fragment of a KIM. In another embodiment, the invention 
provides nucleic acids (preferably DNAs) that hybridize under low or, preferably, high 

stringency conditions to any of the foregoing nucleic acids. In still another embodiment, 

i 

the invention provides nucleic acids (preferably DNAs) which, but for the degeneracy of 
the genetic code, would hybridize to any of the foregoing nucleic acids. In some 

10 embodiments, a nucleic acid encoding all or a unique fragment of a KIM is an engineered 

i 

(recombinant) nucleic acid, optionally in operative association with an expression control 

element or other regulatory element. In other embodiments, a nucleic acid encoding all or 

1 t 

a unique fragment of a KIM is an antisense nucleic acid sufficient, when internalized 
within a cell, to disrupt expression of a cellular KIM gene. 

15 Specific novel KIMs (and unique fragments thereof) of the present invention are 

coded for by nucleic acids having the sequences disclosed herein in TABLE 1 and in the 
Sequence Listing. Other specific KIMs (and unique fragments thereof) are coded for by 
nucleic acids having sequences that are degenerate variants of any of the KIM sequences 
set forth in TABLE 1 and in the Sequence Listing. Still other specific KIMs (and 

20 fragments) are coded for by nucleic acids that are substantially similar to (homologous to) 
any of the KIM sequences in TABLE 1 and in the Sequence Listing. Such KIMs are 
defined herein as variants of the disclosed novel KIM sequences. In some embodiments, 
the present nucleic acid encodes a chimeric polypeptide comprising a novel KIM-encoding 
sequence (i.e., a sequence encoding part or all of a KIM) fused to a non-KIM sequence. 

25 Thus, the invention provides nucleic acids encoding novel KIM fusion proteins, non- 
limiting examples of which include KIM polypeptides fused to a secietable leader 
polypeptide, an immunoglobulin polypeptide, a binding pair partner (e.g., avidin, GST), a 
toxin or toxoid (e.g., ricin, tetanus), an enzyme (preferably one for which a detectable 
substrate is available, e.g., alkaline phosphatase, horseradish peroxidase, luciferase), or 
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other detectable polypeptide (e.g., green fluorescent protein). Additional specific KIMs 
(and unique fragments thereof) are coded for by nucleic acids referred to in TABLE 2. It is 
believed that, in most to all instances, these specific molecules have not heretofore been 

i 

appreciated to be KIMs. 

5 In a second aspect, the invention provides a vector having a KIM-encoding nucleic 

acid inserted therein. In some embodiments, the vector is a biologically functional plasmid 
or viral DNA vector. In other -embodiments, the vector is a retroviral vector. 

In a third aspect, the invention provides a prokaryotic or eukaryotic host cell 
comprising an internalized vector having a KIM-encoding nucleic acid insert. The present 
10 host cell provides intracellular means for producing (synthesizing, folding, processing, or 
secreting) a KIM polypeptide of the present invention. 

In a fourth aspect, the invention provides a process for the production of a KIM 
polypeptide. The present production process includes growing a host cell of the invention 
under culture conditions sufficient for the production of polypeptides of vector origin, and 
15 recovering an expressed KIM polypeptide. 

In a fifth aspect, the invention provides a purified and isolated novel KIM 
polypeptide, preferably substantially free of non-KIM polypeptides or proteins. In some 
embodiments, the present polypeptide is a full-length polypeptide, i.e., a polypeptide 
corresponding to the full-length open reading frame of a novel KIM-encoding cDN A. In 

20 other embodiments, the present polypeptide is a unique fragment of the full-length 

polypeptide. Thus, specific novel KIM polypeptides (and unique fragments thereof) of the 
present invention are coded for by nucleic acids having the sequences disclosed herein in 
TABLE 1 and in the Sequence Listing. Other specific novel KIM polypeptides of the 
present invention are variants of the disclosed novel KIM sequences, including without 

25 limitation splice variants, truncation variants, and substitution variants. In some 

embodiments, the invention provides a chimeric polypeptide comprising a novel KIM 
polypeptide fused (preferably via a peptide bond) to a non-KIM polypeptide. Thus, the 
invention provides novel KIM fusion proteins, exemplified by the above-mentioned fusion 

-5- 



WO 98/53071 



PCT/US98/10547 



constructs. Of course, the invention also provides conjugated or derivatized novel KIM 
polypeptides, including without limitation detectable conjugates, imageable conjugates, 
radiolabeled conjugates, and toxin conjugates* In still other embodiments, the invention 
provides chimeric polypeptides, fusion proteins and conjugates comprising a polypeptide 
5 appreciated herein as being a KIM (see TABLE 2). 

In a sixth aspect, the invention provides an antibody that binds selectively to a KIM 
polypeptide. Preferably, the antibody is a monoclonal antibody (or an engineered 
derivative thereof) produced by conventional means from a hybridoma derived from 
splenocytes of an animal immunized with a KIM polypeptide of the present invention. In 
10 some embodiments, the present antibody is conjugated or derivatized with a detectable 
moiety, toxin, imageable compound or radionuclide. 

In a seventh aspect, the invention provides a pharmaceutical composition 
comprising a therapeutically effective amount of a KIM nucleic acid (e.g., an antisense 
nucleic acid); a KIM vector; a KIM polypeptide; a KIM fusion protein; a KIM-binding 
15 antibody (also referred to as an anti-KIM antibody), dispersed, dissolved or otherwise 
suspended in a physiologically acceptable carrier, vehicle, solvent or excipient. 

In an eighth aspect, the invention provides methods and kits for detecting, either 
qualitatively or quantitatively, KIM expression and/or KIM polypeptide. As disclosed 
herein, KIMs are upregulated by exposure of mammalian kidney (renal) tissue to insult or 

20 injury, particularly ischemia-reperfusion injury. Thus, any KIM (or combination or panel 
thereof) can be used as an indicator of tissue status (preferably renal tissue status), or of a 
change therein, including without limitation exposure to tissue injury, the occurrence or 
stage of an injury process, the occurrence or stage of a tissue repair or regeneration process, 
or the occurrence of an abnormal tissue response to injury or other stimulus, such as an 

25 autoimmune response or an abnormal proliferative response (e.g., a neoplastic response). 

In one embodiment, the invention provides a detection method and kit for 
diagnosis, prognosis, staging, or monitoring of renal injury or of renal disease or of the 
effectiveness of therapy therefor. Some specific embodiments involve detecting and/or 
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i 



measuring the concentration of one or more KIM polypeptides in serum, urine, or urine 
sediment of an individual (a mammal, preferably a human) afflicted with or at risk of 

i 

developing renal injury or an impairment of renal function. Some such embodiments 
involve the use of at least one KIM polypeptide, anti-KIM antibody or conjugate thereof as 
i elements of a reagent kit for immunoassay according to standard techniques. For present 
purposes, any KIM disclosed herein or identified as such herein can be used as a kit 
element, along with other elements for KIM detection. In other instances, such as where 
the detected KIM has biological or enzymatic activity, the kit can include reagents for 

, i 

detecting KIM activity, e.g., by enzyme assay. Other specific embodiments involve 
10 detecting and/or measuring the level of expression of one or more KIM-encoding nucleic 
acids in renal cells in a kidney biopsy, or in cells shed into urine or urine sediment of an 
individual afflicted with or at risk of developing renal injury or an impairment of renal 
function. Some such, embodiments involve the use of a nucleic acid (e.g., RNA or DNA) , 
encoding a KIM polypeptide, or a unique fragment thereof, or a probe nucleic acid capable 

i 

15 of hybridizing to nucleic acid encoding a KIM, as elements of a reagent kit for 

hybridization assay according to standard techniques. As above, any KIM nucleic acid 
disclosed herein or appreciated herein as such can be used as a hybridization kit element, 
along with other reagents for detection of hybridized nucleic acids. 

In another embodiment, the invention provides a method and kit for diagnosis, 
20 prognosis, staging or monitoring of an abnormal response of renal tissue to tissue injury or 
other stimulus, including an autoimmune response or an abnormal proliferative response, 
such as neoplasia arising from or affecting renal tissue. Some specific embodiments 
involve detecting and/or measuring the concentration of one or more KIM polypeptides in 
serum, urine, or urine sediment of an individual (a mammal, preferably a human) afflicted 
25 with or at risk of developing renal injury or an impairment of renal function. Some such 
embodiments involve the use of at least one KIM polypeptide, anti-KIM antibody or 
conjugate thereof as elements of a reagent kit for immunoassay according to standard 
techniques. For present purposes, any KIM disclosed herein or identified as such herein can 
be used as a kit element, along with other elements for KIM detection. In other instances, 
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such as where the detected KIM has biological or enzymatic activity, the kit can include 

i 

i 

reagents for detecting KIM activity, e.g., by enzyme assay. Other specific embodiments 
involve detecting and/or measuring the level of expression of one or more KIM-encoding 
nucleic acids in cells present in a kidney biopsy, or in cells shed into urine or urine 

5 sediment of an individual afflicted with or at risk of developing renal injury or an 

impairment of renal function. Some such embodiments involve the use of a nucleic acid 
(e.g., RNA or DNA) encoding a KIM polypeptide, or a unique fragment thereof, or a probe 
r nucleic acid capable of hybridizing to nucleic acid encoding a KIM, as elements of a 
reagent kit for hybridization assay according to standard techniques. As above, any KIM 

10 nucleic acid disclosed herein or appreciated herein as such can be used as a hybridization 
kit element, along with other reagents for detection of hybridized nucleic acids. 

In a ninth aspect, the invention provides a method and reagent for imaging tissues, 
either in vitro or in vivo. In particular, the invention provides a method and reagent for 
imaging the presence, extent or severity of kidney injury or of kidney tissue repair or 

15 regeneration. Similarly, the invention provides a method and reagent for imaging the 
presence, extent, severity or stage of an abnormal response to tissue injury or other 
stimulus, such as autoimmunity or neoplasia, particularly neoplasia arising from or 
affecting renal tissue. The present invention also provides a method for targetting an 
imageable compound to cells or tissue expressing or producing a KIM. The present 

20 method involves the step of contacting cells or tissue with a detectable (e.g., imageable) 
KIM-binding reagent, which in some embodiments is an anti-KIM antibody or conjugate 
thereof, or a KIM fusion protein. In other embodiments, the detectable KIM-binding 
reagent is a nucleic acid of the invention (e.g., a probe or antisense nucleic acid) labeled 
with a radionuclide or other imageable compound. According to the present method, cells 

25 expressing or producing a KIM are visualized (imaged) by detecting the presence and/or 
location of an accumulation of the KIM-binding reagent. For imaging in vivo, the KIM- 
binding reagent is administered, by any appropriate route, to an individual (a mammal, 
preferably a human) suspected of harboring an imageable locus of KIM expression and/or 
production. Without being limited hereby, it is believed that the present method can be 
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used to detect the presence and/or location of a tissue mass (e.g., a tumor) abnormally 
producing or expressing a KIM. 

In a tenth aspect, the invention provides methods for treating an individual (a 
mammal, preferably a human) afflicted with or at risk of developing a disease or condition 
5 contributed to or associated with a dysfunction or dysregulation of a KIM gene or protein. 
Further, the invention provides methods for treating an individual afflicted with or at risk 
of developing a disease or condition beneficially affected by therapeutic administration of a 
KIM protein. The present methods involve the step of administering to the individual a 
therapeutically effective amount of a KIM polypeptide, variant or fusion protein thereof, or, 
10 conversely, of an anti-KIM antibody. It is expected that such compounds will be useful in 
therapeutic methods which manipulate, e.g., stimulate or inhibit, biological responses that 
are dependent on KIM function. 

! 

In an eleventh aspect, the invention provides a method for inhibiting the growth of 

i 

KIM-expressing tumor cells, involving the step of contacting the cells with an anti-KIM 

15 antibody, conjugated to a toxin or radionuclide. In an alternative embodiment, the method 
involves the step of contacting the cells with an antisense KIM nucleic acid that is 
sufficient to suppress or disrupt expression of a KIM gene in the tumor cells. 

In a twelfth aspect, the invention provides a method of gene therapy. The present 
method involves the administration of a vector capable of directing the production of a 
20 KIM, to an individual afflicted with or at risk of a renal disorder, disease or injury. As a 
result of the present method, growth of new tissue, preferably renal tissue, is stimulated, or 
survival of existing tissue, preferably renal tissue, is promoted. 

The foregoing and other objects, features, aspects and advantages of the present 
invention, as well as the invention itself, will be more fully understood from the following 
25 description of preferred embodiments. 



WO 98/53071 PCT/US98/10547 

Detailed Description of the Invention 

i 

The discovery, reported herein, of a novel class of Kidney Injury-related Molecules 

r 

(KIMs) was made by analyzing differences in mRNA expression between normal adult 
mammalian kidneys, and kidneys in the process of regenerating following exposure to 

5 tissue insult (specifically, to ischemia-reperfusion injury). Two established techniques 
^ were used for this purpose: representational difference analysis (RDA), and suppression 
subtractive hybridization (SSH). Both techniques were used to assess cDNAs isolated 
from various timepoints, e.g., 48 hours, after the onset of ischemia. In these studies, the 
normal adult kidney material was isolated from sham-operated subjects. Both techniques 

10 resulted in the depletion of cDNAs which are common to both postischemic and to normal 
kidney samples, leaving a pool of cDNAs which are significantly expressed only in injured 

or regenerating kidney tissue. This pool likely contains cDNAs corresponding to genes 

1 i 

that encode proteins involved in the injury process, and proteins involved in tissue repair or 
regeneration processes. Therefore, cDNAs isolated from the pool likely encode parts or all 
15 of proteins likely to be therapeutically beneficial for treatment or prophylaxis of tissue 
injury, especially renal injury. Several cDNA clones have been obtained, sequenced and 
characterized. 

Selected Definitions 

A "KIM protein", herein used synonymously with "KIM", is any protein or 

20 polypeptide encoded by mRNA which is selectively upregulated following injury to a 
kidney. One group of KIM proteins of interest includes those coded for by mRNA which 
is selectively upregulated at any time within one week following any insult which results in 
injury to renal tissue. Examples of times at which such upregulation might be identified 
include 10 hours, 24 hours, 48 hours or 96 hours following an insult. Examples of tissue 

25 insults include toxin exposure, hypoxia, hyperoxia, hemodynamic disruption, ischemia, 
reperfusion, or mechanical compression. Many different types of proteins fall within the 
KIM class, including cell surface proteins (e.g., ligands or counter-receptors involved in 
cell-cell or cell-matrix interactions), secreted proteins (e.g., diffusable biological response 
modifiers, such as growth factors, differentiation factors, survival factors and the like), 
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intracellular proteins (e.g., elements of a signalling pathway), and nuclear proteins (e.g., 
transcription factors). 

A "KIM ligand" is any molecule which noncbvalently and specifically binds to a 
KIM protein. Such a ligand can be a protein, peptide, steroid, antibody, amino acid 

1 5 derivative, or other type molecule, in any form, including naturally-occurring, 

recombinant^ produced, or otherwise synthetic. A KIM ligand can be in any form, 
including soluble, membrane-bound, or part of a fusion construct with immunoglobulin, 
fatty acid, or other moieties. The KIM ligand may be an integrin. A membrane-bound 
KIM ligand can act as a receptor which, when bound to or associated with KIM, triggers a 

10 cellular response. In some interactions, a KIM may associate with a plurality of KIM 
ligands, or may associate with a KIM ligand as part of a complex with one or more other 

molecules or cof actors. In a situation where both the KIM and the KIM ligand are bound 

i 

to cell membranes, i the KIM may associate and react with KIM ligand which is bound to 
the same cell as the KIM, or it may associate and react with KIM ligand be bound to a 
15 second cell. Where the KIM ligation occurs between molecules bound to different cells, the 
two cells may be the same or different with respect to cellular type or origin, phenotypic or 
metabolic condition, or type or degree of cellular response (e.g., growth, differentiation or 
apoptosis) to a given stimulus. "KIM ligation*' refers to the contact and binding of KIM 
with a KIM ligand. 

20 A "unique fragment" of a nucleic acid means any fragment of sufficient length to 

have a sequence likely to be substantially unique in a mammalian genome. Thus, a unique 
fragment generally means an oligonucleotide at least 16 nucleotide bases in length. 
Similarly, a "unique fragment" of a polypeptide means any fragment of sufficient length to 
have an amino acid sequence likely to be substantially unique to a given mammalian 

25 protein, such as a KIM. Thus, a unique peptide fragment generally means a peptide at least 
7 amino acids in length. 

A "KIM variant" means a KIM whose sequence differs from a sequence disclosed 
herein by the presence of one or more internal or terminal insertions, deletions or 
substitutions of a nucleotide (when referring to KIM nucleic acids) or of an amino acid 
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WO 98/53071 



PCT/US98/10547 



(when referring to KIM polypeptides). Preferably, the KIM variant is "substantially 
similar" to the corresponding disclosed KIM, or to a unique fragment thereof. That is, the 
KIM variant is "homologous" to the corresponding disclosed KIM. "Substantially 
similar" or "homologous" variants are structurally similar to the corresponding disclosed 

5 KIM. Further, "substantially similar" or "homologous" variants have sufficient functional 
similarity to the corresponding disclosed KIM that they share one or more of the KIM's 
biological properties or functions (e.g., binding to a receptor or ligand, triggering of 
biological responses, transport of a metabolite, catalysis of a substrate, or the like). A KIM 
variant can be naturally occurring or synthesized or produced by routine techniques, such 

10 as molecular engineering techniques. 

By "alignment of sequences" is meant the positioning of one sequence, either 
nucleotide or amino acid, with that of another, to allow a comparison of the sequence of 

relevant portions of one with that of the other. Generally, sequences are aligned using the 

i 

GAP and BESTFTT programs, which are based on the teachings of Needleman et al. (J. 

15 Mol. Biol. 48:443-453, 1970), Smith et al. (Adv. Appl. Math. 2!482-489, 1981), and 
Rechild et al. (CABIOS 5: 107-1 13, 1989). Generally, when a homologous variant of a 
KIM is aligned with the corresponding KIM polypeptide, it will share amino acids that 
contribute to the KIM's three dimensional structure, such as cysteine residues. Although 
the relative positions of cysteine residues is generally conserved in the variant sequence, 

20 homologous or functionally equivalent sequences can include functionally equivalent 

arrangements of the cysteines, including arrangements comprising amino acid insertions or 
deletions which alter the linear arrangement of the cysteines, but do not materially impair 
their relationship or ability to form disulfide bonds in the folded structure of the KIM 
protein. Therefore, minor internal gaps and amino acid insertions, such as occur in splice 

25 variants, are ignored when aligning sequences herein. 

"Sequence homology", "percent (%) homology", "sequence similarity" and 
"percent (%) similarity" are used interchangeably herein, and refer to the sum of the 
percentage of residues (whether nucleotides or amino acids) that are, when aligned with a 
reference sequence, the same as the corresponding reference residues, and those that are, 
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when aligned, conservative substitutions for the corresponding reference residues. 

"Sequence identity" and "percent (%) identity" also are used interchangeably herein, and 
refer to the percentage of residues (whether nucleotides or amino acids) that are, when 
aligned with a reference sequence, the same as the corresponding reference residues. Both 
5 sets of terms are used according to their definitions in Altschul et aL (1990), J. Mol. Biol. 
215:403-410 and in the Basic Local Alignment Search Tool (BLAST) algorithm described 
therein. For present purposes, the algorithm gap weight is set at 3.0 and the length weight 
is set at 0.1- 

i 

"Hybridization" means the formation of a duplex nucleic acid molecule, in which 
10 nucleotide bases of a first polynucleotide strand bind noncovalently with cognate 

nucleotide bases of a second polynucleotide strand. Generally, such noncovalent binding 

i 

occurrs in DNA only between adenosine (A) and thymidine (T) bases, and guanosine (G) 
and cytosine (C) bases. In RNA, binding occurs only between A and uracil (U), and G and 
C. The apposition of non-cognate bases in a duple* nucleic acid molecule, e.g., A with C, 

15 is termed a "mismatch" pairing. Under high stringency hybridization conditions, 
mismatches occur rarely in a given duplex nucleic acid molecule. Low stringency 
conditions permit the occurrence of some mismatches. Exemplary conditions which 
promote DNA hybridization, termed "hybridization conditions", include 6.0X sodium 
chloride/sodium citrate (SSC) at about 45°C, followed by a wash of 2.0X SSC at 50°C 

20 The salt concentration in the wash step can be selected from a low stringency wash of 

about 2.0X SSC at 50°C, to a high stringency was of about 0.2X SSC at 50°C. In addition, 
the temperature in the wash step can be selected from a low stringency wash at room 
temperature, about 22°C, to a high stringency wash at about 65°C. 

A "KIM agonist" is a molecule which can specifically trigger a cellular response 
25 normally triggered by the interaction of KIM with a KIM ligand. A KIM agonist can be a 
KIM variant, or a specific antibody to KIM, or a soluble form of the KIM ligand. 

A "KIM antagonist" is a molecule which can specifically associate with a KIM 
ligand or a KIM, thereby blocking or otherwise inhibiting KIM binding to the KIM ligand. 
The antagonist binding blocks or inhibits cellular responses which would otherwise be 
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triggered by ligation of the KIM ligand with KIM or with a KIM agonist. Examples of 
KIM antagonists include certain KIM variants, KIM fusion proteins and specific antibodies 
to a KIM ligand or KIM. 

"Isolated" refers to a condition in which a nucleic acid or polypeptide of the present 
i invention is essentially free of other nucleic acids, polypeptides, or of other contaminants 
1 with which it might normally be found in nature, and as such exists in a form not found in 
nature. 

"Substantially pure" refers to a condition in which a nucleic acid or polypeptide of 
the present invention is separated from other nucleic acids, polypeptides, or other 

i 

10 contaminants, particularly naturally occurring contaminants, that interfere with the ability 

to detect, visualize or isolate the the nucleic acid or polypeptide of the invention, or that 

i 

interfere with a biological function or property thereof. A substantially pure nucleic acid or 
polypeptide of the invention is not generally found in nature. 

A "chemical derivative" of another molecule contains one or more additional 
15 chemical or biochemical moieties not found naturally in association with the molecule. 
Such moieties may iiriprove the molecule's solubility, absorption, biological half life, etc. 
The moieties may alternatively decrease the toxicity of the molecule, eliminate or attenuate 
any undesirable side effect of the molecule, etc. Moieties capable of mediating such effects 
are disclosed, for example, in Remington's Pharmaceutical Sciences. 16th ed„ Mack 
20 Publishing Co., Easton, Penn. ( 1980). 

Other terms used herein, including "antisense DNA", "antisense probe", "cloning", 
"cDNA", "cDNA library", "DNA polymorphism", "expression", "gene", "hybridoma", 
"plasmid", "probe", "labeled", "recombinant", "host cells", "transformed", "transfected", 
"vector", and the like are used in the sense of their art-recognized meanings in the fields of 
25 cell biology, cell culture, molecular biology, transgenic biology, microbiology, 
recombinant DNA or genetic engineering, and immunology. Such meanings are 
determined by consultation of one or more of the following widely available texts: 
Molecular Cloning. A Laboratory Manual . 2nd Ed. (Sambrook, Fritsch and Maniatis, eds.), 
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Cold Spring Harbor Laboratory Press, 1989; DNA Cloning . Volumes I and II (Glover, ed.), 
1985; Oligonucleotide Synthesis (Gait, ed.), 1984; U.S. Patent 4,683,195, Mullis et al., 
invs.; Nucleic Acid Hybridization (Hames & Higgins, eds.), 1984; Transcription and 
Translation (Hames & Higgins, eds.), 1984; Culture of Animal Cells (Freshney), Alan R. 

5 Liss, publ., 1987; Immobilized Cells and Enzvmes (IRL Press), 1986; A Practical Guide to 
Molecular Cloning (Perbal), 1984; Current Protocols in Molecular Biology . Wiley & Sons, 
publ., 1989; Methods in Enzvmologv . Academic Press, New York NY (especially Volumes 
154 and 155); Gene Transfer Vectors for Mammalian' Cells (Miller and Calos, eds.), Cold 
Spring Harbor Laboratory Press, 1987; Immunochemical Methods in Cell and Molecular 

10 Biology (Mayer and Walker, eds.), Academic Press, London, 1987; Handbook of 
Experimental Immunology . Volumes I-IV (Weir and filackwell, eds.), 1986; and, 
Manipulating the Mouse Embrvo . Cold Spring Harbor Laboratory Press, 198(5. 

! 

Compounds of the Invention 

i 

Specific, novel KIMs discovered through RDA or SSH techniques are set forth 
15 below in TABLE 1 and in the Sequence Listing. TABLE 1 lists KIMs for which potential 
full length (PFL) cDNA clones were obtained. Also shown are the predicted polypeptide 
sequences encoded in the open reading frames (ORFs) of many of the newly discovered 
KIMs. In addition, TABLE 1 provides, where relevant, the subtracted amplified clone 
(SAC) designations and the PFL location(s) of partial sequences, many of which were 
20 reported in prior U.S. Provisional S.N. 60/047,490 and 60/047,49 1 . 
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TABLE 1, continued 



SEQ. NO. 


PFL 


ALII* 

CLONE , 
DESIGNATION 


OPEN READING FRAME (OFR) 
LOCATION SEQ. ID # 


# 80 


HW090 
HW118 


SAC_24197 


335 . 937 


# 81 


# 82 


HW092 


SAC_24320 






# 83 


HW093 


SAC 24538 


8 . . 1357 


# 84 


# 85 


HW094 


SAC_23896 


255.. 1238 


# 86 


# 87 


HW095 


SAC 23802 


. 7 . . 876 


# 88 


# 89 


HW096 


' NONE 


295 . . 1302 


# 90 



1 J ' 

Additional KIMs, which are believed to be related to (e.g., substantially similar to) 
known molecules also were obtained through RDA or SSH techniques, and are set forth in 
TABLE 2 and in the Sequence Listing. It is believed that few to none of these molecules 

were previously appreciated to be KIMs, i.e., to have relevance to tissue injury or repair 

1 

processes, particularly in the kidney. TABLE 2 lists these KIMs along with their clone 
designations. Partial sequences of some of the cDNA clones listed below were also set 
forth in the priority document. Also shown are the name(s) and GENB ANK accession 
numbers of the known genes to which these KIMs may be related. 



TABLE 2 



SEQ1D# 


DESIGNATION 


RELATED GENE 


Accession # 


92 


23798 


Rat alpha 1 A (10) adrenergic 
receptor 


M60654 


93 


23800 


Rat N27 


V30789 


94 


23801 


Rat alpha-2mu globulin-related 
protein 


X13295 


95 


23833 


Rat contrapsin-like protease 
inhibitor related protein 


X16359 


96 


23836 


Rat Fit-1 


V04319 


97 


23853 


Rat tumor-associated antigen 
(oE4) 


L12025 


98 


23906 


Rat sulfated glycoprotein 2 


X13231 


99 


23907 


Mouse secreted protein (p85) 


L33416 


100 


23916 


Human Differentiation-dependent 
A4 protein 


L09604 
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TABLE 2, continued 



SEQ 1D# 


DESIGNATION 


RELATED GENE 


A cess) n # 


101 


23922 


Rat manganese-containing 
superoxide dismutas 


Y00497 


102 


24030 


Rat heme oxygenase 


J02722 


103 


24078 


Rat tissue inhibitor of 
metailoprotein 
ase 1 (TIMP1) 


U16022 


104 


24105 


Rat alpha 2- macroglobulin 


J02635 


105 


24140 


Rat Cathepsin D 


X54467 


106 


24142 


Rat lysozyme 


L12458 


107 


24192 


Mouse MAMA mRNA 

i 


X67809 


108 


24296 


Rat transketolase 


U09256 


109 


24445 


Rat heat stable antigen CD24/ 
Elongation factor 1 


Z11531 


110 


24527 


Cathepsin L 


S85184 


111 


24540 


Mouse endothelial monocyte- 
activating polypeptide 1 


U41341 


112 


24623 


Rat spleen thymosin beta4 


K01334 


1 


HW010 


Human semaphorin (CD 100) 


U60800 


11 


HW016 


Mouse ribonucleotide reductase 
M2 subunit 


M 14223 




HW032 


Human S100C 


D49355 


24 


HW038 


Human splicing factor SF3a120 


X85237 


25 


HW039 


Annexin 1 1 


MB2802 


41 


HW049 


Mouse u22 snoRNA host gene 
(UHQ) 


U40654 


48 


HW054 


M^fe * |£ 1 1 & I'd it 

Rat myeltn oligodendncyte protein 
(MOG) 


M99485 


59 


HW062 


Mouse proliferation-associated 
protein 1 


U43918 


60 


HW066 


Human cleavage stimulation 
factor SOkd subunit 


L02547 


63 


HW070 


Human BRCA2 region mRNA 
sequence CG037 


U50523 
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TABLE 2, continued 



SEQ ID # 


DESIGNATION 


RELATED GENE 


Accession # 


70 


HW078 


Human G protein gamma-10 
subunit 


P50151 


74 


HW083 


Rat ins-1 winged helix 


P97691 


91 


HW097 


Human Na+ channel 2 


U781B1 



Any of the foregoing KIM cDNAs can be inserted into a vector, liposome or other 
carrier vehicle for internalization and production in a host cell. Furthermore, the invention 
5 encompasses derivatives and variants of each of the foregoing KLMs as listed in TABLES 1 
and 2, 

i 

One embodiment of the invention provides soluble variants of a KIM protein that is 
usually synthesized as a membrane associated protein. Soluble variants lack at least a 
portion of the transmembrane or intra-membrane section of a native KIM protein. In some 

10 examples, the soluble variant lacks the entire transmembrane or intra-membrane section of 

i i 

a native KIM protein. Soluble variants include fusion proteins which encompass 
derivatives of KIM proteins that lack at least a portion of the transmembrane or intra- 
membrane section of a native KIM protein. AH types of KIM fusion proteins are included, 
particularly those which incorporate his-tag, Ig-tag, and myc-tag forms of the molecule. 
15 These KIM fusions may have characteristics which are therapeutically advantageous, such 
as the increased half-life conferred by the Ig-tag. Also included are fusion proteins which 
incorporate portions of selected domains of the KIM protein. 

Variants can differ from a naturally occurring KIM protein in amino acid sequence 
or in ways that do not involve sequence, or both. Variants in amino acid sequence are 

20 produced when one or more amino acids in naturally occurring KIM protein is substituted 
with a different natural amino acid, an amino acid derivative or non-natural amino acid. 
Particularly preferred substitution variants include naturally occurring KIM proteins, or 
biologically active unique fragments thereof, whose sequences differ from the wild type 
sequence by one or more conservative amino acid substitutions, which typically have 

25 minimal influence on the secondary structure and hydrophobic nature of the protein or 
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peptide. Conservative substitutions typically include the substitution of one amino acid for 
another with similar characteristics, such as substitutions within the following groups: 
valine, glycine; glycine, alanine; valine, isoleucine; aspartic acid, glutamic acid; 
asparagine, glutamine; serine, threonine; lysine, arginine; and phenylalanine, tyrosine. 
n5 Other conservative substitutions are those which meet the criteria for an "accepted point 
^ mutation" as defined in the Atlas of Protein Sequence and Structure (Dayhoff et al., eds.), 
1978. See also PCT publication no. WO97/44460: 

Substitution variants can also have sequences which differ by one or more non- 

i i 

conservative amiho acid substitutions, provided however that the substitution does not 
10 abolish the native KIM protein's biological activity, ligand- or receptor-binding 

characteristics, or other functional property of interest. Exemplary non-conservative 
substitutions are those ,in which: (I) a hydrophilic residue, e.g., serine or threonine, is 
substituted for (or by) a hydrophobic residue, e.g., leucine, isoleucine, phenylalanine, or, 
alanine; (ii) a cysteine residue is substituted for (or by) any other residue; (iii) a residue 
15 having an electropositive side chain, e.g., lysine, arginine or histidine, is substituted for (or 
by) a residue having an electronegative charge, e.g., glutamic acid or aspartic acid; or (iv) a 

i 

residue having a bulky side chain, e.g., phenylalanine, is substituted for (or by) one not 
having such a side chain, e.g., glycine. 

Other KIM valiants oi this invention include truncation variants ^comprising at 
20 least a unique fragment of the corresponding KIM), insertion variants, and splice variants. 

Other KIM variants within the invention are those with modifications which 
increase polypeptide stability. Such variants can contain, for example, one or more non- 
peptide bonds (which replace the peptide bonds) in the peptide sequence. Also included 
are: variants that include residues other than naturally occurring L-amino acids, such as D- 
25 amino acids or non-naturally occurring or synthetic amino acids such as beta or gamma 
amino acids and cyclic variants. Incorporation of D- instead of L-amino acids into the 
polypeptide may increase its resistance to proteases. See, e.g., U.S. Patent 5,219,990. 
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KIM variants can be naturally-occurring, or produced through synthetic or 
molecular engineering techniques. Those of skill in the art will understand and appreciate 
that an engineered KIM variant can provide advantageous properties, e.g., in facilitating 
purification, improving stability, modulating a biological function, or the like. Thus, in 
5 some instances, a KIM variant will be desired that lacks a glycosylation site, or that has 

i 

decreased aggregation potential due to elimination of a hydrophobic surface, or that has 
improved folding efficiency due to elimination of a cysteine residue, or the like. 

In general, ,KIM variants have at least fifty (50) % amino acid sequence homology 
or similarity with the corresponding KIM protein. Preferably, the variants have at least 

i 

10 sixty-five (65) % sequence similarity, more preferably at least eighty (80) % similarity with 
the corresponding KIM protein. Still more preferably, the sequence similarity is at least 

ninety (90) %, or most preferably, at least ninety-five (95) %. Other preferred KIM 

i 

variants are those which 1) share at least forty (40) % similarity to the corresponding KIM 
protein, and 2) share at least eighty (80) % of aligned cysteine residues with the 

i 

15 corresponding KIM protein. 

Just as it is possible to replace substituents of the amino acid backbone or scaffold, 
it is also possible to replace, modify or add non-amino acid moieties to the scaffold. Such 
moieties can occur naturally in a given KIM as a result of post-translational processing, 
including acetylation, methylation, phosphorylation, carboxylation or glycosylation. As 
20 desired, such moieties can be removed of added by conventional synthetic or biochemical 
techniques. Further, non-natural moieties can be added to produce a derivative of a KIM or 
KIM variant of this invention. For example, polyethylene glycol (PEG) can be linked to a 
KIM to improve its stability or pharmacokinetic properties. 

As will be readily appreciated, anti-KIM and anti-KIM variant antibodies can be 
25 produced by conventional techniques. Specifically contemplated are polyclonal and 
monoclonal antibodies, including antigen-binding fragments thereof and engineered 
derivatives thereof. Antigen-binding fragments of intact antibodies include complete Fab 
fragments, F(ab T )2 compounds, V H regions, and Fv regions. Engineered derivatives of 
antibodies of the invention include single chain antibodies (see, e.g., WO 96/23071), as 
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well as human, humanized, primatized, or chimeric antibodies (see, e.g., PCT/US 
95/16709). Engineered derivatives of the present antibodies are produced, generally, using 
standard recombinant DNA techniques (Winter and Milstein, Nature 349: 293-99, 1991), 
These include "chimeric" antibodies, in which the antigen binding domain from an animal 
5 antibody is linked to a human constant domain. In this instance, an antibody with the 
desired KIM-binding selectivity is derived initially from a nonhuman mammal (e.g., a 
mouse, rat or hamster), and subjected to recombinant DNA manipulation to replace all or 
part of the hinge and constant regions of the heavy chain and/or the constant region of the 
light chain, with corresponding regions from a human immunoglobin light chain or heavy 

i 

10 chain. (See, e.g., Cabilly et al„ United States Pat. No. 4,816,567; Morrison et aL, Proc. 
Natl. Acad. Sci. 81: 6851-55, 1984). Chimeric antibodies reduce the immunogenic 

responses elicited by animal antibodies when used in human clinical treatments. 

i 

In addition, recombinant "humanized" antibodies can be synthesized. Humanized 
antibodies are antibodies initially derived from a nonhuman mammal in which recombinant 

15 DNA technology has been used to substitute some or all of the amino acids not required for 
antigen binding with amino acids from corresponding regions of a human immunoglobin 
light or heavy chain. That is, they are chimeras comprising mostly human immunoglobulin 
sequences into which the regions responsible for specific antigen-binding have been 
inserted (see, e.g., PCT patent application WO 94/04679). Animals are immunized with 

20 the desired antigen, the corresponding antibodies are isolated and the portion of the 
variable region sequences responsible for specific antigen binding are removed. The 
animal-derived antigen binding regions are then cloned into the appropriate position of the 
human antibody genes in which the antigen binding regions have been deleted. Humanized 
antibodies minimize the use of heterologous (inter-species) sequences in antibodies for use 

25 in human therapies, and are less likely to elicit unwanted immune responses. Primatized 
antibodies can be produced similarly. 

If desired, fully human antibodies with KIM-binding specificity which can be 
produced in nonhuman animals, such as transgenic animals harboring one or more human 
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immunoglobulin transgenes. Such animals may be used as a source for splenocytes for 

producing hybridomas, as is described in U.S. 5,569,825. 

t 

Identification of Specific KIMs of the Invention 

i To assess the dynamic biological processes of response to injury and tissue repair, a 

kidney ischemia-reperfusion model, which similates acute renal failure, was employed. 
The results of studies in this model system now have been published, and appear in 
Ichimura et al. (1998) . J. Biol. Chem. 273:4135-4142, the teachings of which are 
incorporated by reference herein. In this model, the kidney has the capacity for cell 
renewal (both structurally and funcitonally) after injury to tubular epithelial cells. The 

10 nephron is damaged functionally by an ischemic reperfusion injury that results in regional 
areas of proximal tubule cell death. During the repair process, the kidney proximal tubule 

epithelium undergoes a complex series of events including (1) cell death and cast 

i , 1 

formation in the tubule lumen (casts are aggregates of dead, semiviable and viable cells, as 

> i i 

well as of cell debris); (2) proliferation of surviving proximal tubule epithelial cells; (3) 

15 formation of a poorly differentiated regenerative epithelium over the denuded basement 
membrane (this simplified epithelium expresses vimentin, a mesenchymal marker); and (4) 
differentiation of the regenerative epithelium to form fully functional proximal tubule 
epithelial cells. Gene expression analysis of the kidney at various timepoints following the 
onset of ischemia revealed the upregulation of many KIMs anticipated to be involved in the 

20 injury and repair/regeneration processes. 

1, Generation of ischemic and normal rat adult kidneys 

Ischemic injured rat kidneys are generated as described by Witzgall et al. (J. Clin 
Invest. 93: 2175-2188, 1994). Briefly, the renal artery and vein from one kidney of an 
adult Sprague-Dawley rat are clamped for 40 minutes and then reperfused. Injured 
25 kidneys are harvested from the rats at 24 hours and at 48 hours after reperfusion. Kidneys 
from sham-operated, normal adult Sprague-Dawley rats are also harvested. 

2. mRNA isolation 
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Post-ischemic rat kidneys arc prepared, as described by Witzgall et al. (J. Clin. 
Invest. 93: 2175-2188, 1994). Briefly, the renal artery and vein of the left kidney of an 
adult Sprague Dawley rat are clamped for 40 minutes and then reperfused. The ischemic 
kidneys are removed from rats either 24 hours or 48 hours after reperfusion. Normal rat 
5 kidneys are used as controls. 

Total RNA is isolated with Trizol reagent (BRL catalog No. 1559-026). The kidney 
capsules are removed, then the kidneys are rinsed in PBS and immediately homogenized in 
Trizol solution. In some instances, the kidney is chopped into several pieces in PBS, frozen 
quickly in liquid nitrogen and kept at -70°C until it is processed for total RNA isolation. 
10 The mRNA is purified from total RNA according to the manufacturer's instructions, using 
an mRNA purification kit (Pharmacia, Catalog No. 27-9258-02). 

3(i). Representational difference analysis (RDA) method to clone short-fragment cDNAs 
up-regulated bv ischemia ( 

The RDA method of Hubank and Schatz (Nucleic Acid Research 22:5640-48, 
15 1994) is performed as described, with modifications as follow. Double-stranded cDNA is 
synthesized using the Superscript Choice® system (BRL Catalog No. 18090), from mRNA 
isolated from rat kidney either 24 hours or 48 hours post-ischemia as a tester and normal 
rat kidney as a driver. The tester and driver cDNA is digested with DpnII and ligated to R- 
Bgl-12/24 oligonucleotides. The adapter-ligated cDNA is amplified by PGR, and the PCR 
20 product is digested with DpnII to remove the oligonucleotides. The tester cDNA is ligated 
to J-Bgl- 12-24 oligonucleotides. 

Full length cDNA clones for Kim-1 and NMB contained in a plasmid vector are 
provided by T. Ichimura. They are digested with Sau3Al t purified and used as quenching 
reagents. cDNA fragments, 2-3 containing a fragment of annexin II, and 3-8 containing a 
.5 fragment of alpha 2 macroglobulin (both provided by T. Ichimura) are amplified by PCR 
with R-Bgl-24 primer, digested with DpnII to remove the oligonucleotides and used as 
quenching reagents. Tester cDN A is hybridized to an excess driver cDNA including the 
four quenching reagents described above. After mung bean nuclease digestion. 
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hybridization mixture is amplified by PGR to enrich the cDNA fragments upregulated by 
ischemia. Each of the PCR steps is tested with a range of template DNA concentrations, 
and the concentration giving the best signal is chosen for the next steps. The DNA mixture 
containing tester and driver DNA for the subtractive hybridization step is washed twice 
5 with 70% EtOH by spinning for 3 minutes. 

The second round cDNA subtraction is performed by hybridizing the first round 
cDNA subtraction product ligated to N-Bgl- 12-24 oligonucleotides to an excess of the 
same driver cDNA used for the first round subtraction. The third round cDNA subtraction 
is performed by hybridizing the second round cDNA subtraction product ligated to J-Bgl- 

10 12/24 oligonucleotides to an excess of the same driver cDNA used for the first round of 

i 

cDNA subtraction at a ratio of 1: 14,000. The final PCR product after the third round of 
subtraction is digested with Dpnll to remove the oligonucleotides and ligated to pCR- 
Script SK(+) vector. This represents a selected cDNA library by RDA. 

! 

I 

3(ii). Suppression subtractive hybridization (SSH) method to clone short-fragment cDNAs 

i i 

15 up-regulated bv ischemia 

SSH is perfomed according to the manufacturer's instructions (Clontech, Catalog 
No. K 1804-1) with certain modifications. Briefly, double-stranded cDNA are synthesized 
from mRNA isolated from rat kidney 24 hours or 48 hours post-ischemia as a tester, and 
from nomal rat kidney as a driver with the Superscript Choice™ system for cDNA 

20 synthesis (BRL Catalog No. 18090). The cDNA is synthesized according to the 

manufacturer's instruction except that the first strand cDNA is synthesized at 42°C, cDNA 
is digested with RSAI. Tester cDNA after RS AI digestion is ligated to adaptors. For the 
first hybridization, tester cDNA with adaptors is hybridized with excess driver cDN A at 
68°C for 12 hours. For the second hybridization, the mixture is hybridized at 68°C for 22 

25 hours. The cDNA mixture after the second hybridization is initially amplified by PCR for 
27 cycles only. Then, a portion of the PCR product is re-amplified for 10 or 13 cycles. The 
final subtracted PCR product is digested by RSAI to remove the adaptor, and separated on 
a gel. The three bands and the remaining smear are cut out separately and cloned to pCR- 
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Script SK (+) vector (Stratagene, Catalog No. 21 1 188). This represents a selected cDNA 

i 

library by SSH. 

4. Isolation and Characterization of Subtracted Amplified cDNA Clones 

Colonies are randomly picked from the selected RDA and SSH cDNA libraries, and 

ii 

5 , plasmid DNA is isolated with Qiagen plasmid kit. Each cDNA clone, referred to as a 
Subtracted Amplified Clone (SAC) is sequenced with the vector primers. DNA sequences 
are checked against GeneBank/EMBL databases for homology by the program BLAST™. 

Southern blots are prepared with the initial PCR products generated from normal 
and injured rat kidneys. The blots are hybridized to the inserts isolated from the S ACs to 

i 

10 confirm which SACs are induced by ischemia. 

4. Isolation and Characterization of potential full length cDNA clones 

4.25 ug of polyA+ mRNA isolated from rat kidney 24 hours post-ischemia, which 
has been purified twice by oligo dT chromatography, is used to generate a cDNA library. In 
order to obtain double-stranded cDNA with EcoRI adaptors, a Superscript Choice® 

15 system for cDNA synthesis (BRL Catalog No. 18090) is used following the supplier's 
protocol, except that the cDNA synthesis reaction is carried out at 42 C. The DNA is 
ethanol precipitated, washed, resuspended in 5 ul H20, and ligated to 2 ug EcoRI digested 
and CIAP-treated Lambda ZAPII fStrategene catalog No. 236612). The ligated DNA is 
packaged and used to infect E. coli XLr 1 Blue MRF. The complexity of the library is 

20 3.4xl0 6 independent recombinant plaques. Another cDNA library is generated from 4 ug of 
polyA+ RNA from rat kidney 48 hours post-ischemia as described above. 

The two cDNA libraries described above are screened with inserts from the SAC 
clones. pBluescript plasmid vector containing cDNA inserts are excised from Lambda 
ZAPII vector by in vivo excision. Inserts from the longest cDNA clones, referred to as 
25 Potential Full Length (PFL) clones, are sequenced. DNA sequences are checked by the 
program BLAST™ against the GenBank/EMBL databases for DNA homology. Predicted 
protein coding regions (open reading frames, or ORFs) from some of the PFL clones also 
were used to search the GenBank/EMBL databases. 
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Diagnostic Uses of the Compounds of the Invention 

Anti-KIM antibodies of the invention, which specifically bind to a protein of the 

r 

i 

invention or a unique fragment thereof, are useful in several diagnostic methods. These 
agents can be labeled with detectable markers, such as fluoroscopically or radiographically 
5ii opaque substances, and administered to an individual to allow imaging of tissues which 
1,1 express KIM protein. The agents can also be bound to (conjugated to) substances, such as 
horseradish peroxidase, which can be used as immuriocytochemical stains to allow 

visualization of areas of KIM protein-positive cells on histological sections. A specific 

1 i - 

antibody could be used alone in this manner, and sites where it is bound can be visualized 
10 in a sandwich assay using an anti-immunoglobulin antibody which is itself bound to a 

1 » i 

detectable marker. 

Specific antibodies to KIM protein are also useful in immunoassays to measure 

■I 

KIM presence or concentration in samples of body tissues and fluids. Such concentrations 
may be correlated with different disease states. As an embodiment of particular interest, 

i 

15 the invention includes a method of diagnosing renal injury, or of monitoring a process of 
renal repair, by measuring the concentration of KIM or of KIM fragments in the urine, 
plasma or serum of a patient. Similarly, KIM can be measured in urine sediment, in 
particular in cellular debris in the urine sediment. Casts of renal tubule cells, which may be 

present in urine sediment from patients with ongoing renal disease, may contain elevated 
20 levels of KIM protein and mRNA. 

Specific antibodies to KIM protein may also be bound to solid supports, such as 
beads or dishes, and used to remove the ligand from a solution, either for measurement, or 
for purification and characterization of the protein or its attributes (such as post- 
radiational modifications). Such characterization of an individual's KIM protein is 
25 expected to be useful in identifying deleterious mutants or processing defects which 

interfere with KIM function and are associated with abnormal phenotypes. Each of these 
techniques is routine to those of skill in the immunological arts. 

Additional imaging methods utilize KIM or KIM fragments, fused to imageable 
moieties, for diagnostic imaging of tissues that express KIM ligands, particularly tumors. 
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Further diagnostic techniques are based on demonstration of upregulated KIM 
mRNA in tissues, as an indication of injury-related processes. (See, e.g., PCT publication 
no. WO 97/44460). 

Therapeutic Uses of the Compounds of the Invention 

5 The therapeutic methods of the invention involve selectively promoting or 

inhibiting cellular responses that are dependent on the presence or concentration of any 
KIM, including without limitation KIM ligation, KIM enzyme activity, KIM secretion, 
KIM signalling, and KIM modulation of gene expression* \ In instances where a KIM and a 
KIM ligand are both membrane bound, and expressed by different cells, the signal 

10 transduction may occur in the KIM-expressing cell, in the KIM ligand-expressing cell, or 
in both* 

! 

KIM ligation-triggered response in a KIM ligand-expitssing cell may be generated 

by contacting the cell with exogenous KIM, KIM fusion proteins or activating antibodies 

i 

against KIM ligand, either in vitro or in vivo. Further, responses of the KIM ligand- 
15 expressing cell that would otherwise be triggered by endogenous KIM could be blocked by 
contacting the KIM ligand-expressing cell with a KIM ligand antagonist (e.g., an antagonist 
antibody that binds to KIM ligand), or by contacting the endogenous KIM with an anti- 
KIM antibody or other KIM-binding molecule which prevents the effective ligation of KIM 
with a KM ligand. 

20 Similarly, the responses triggered by KIM ligation in the KIM-expressing cell may 

be promoted or inhibited with exogenous compounds. For example, KIM ligation- 
triggered response in a KIM-expressing cell may be generated by contacting the cell with a 
soluble KIM ligand, or certain anti-KIM activating antibodies. Further, responses of the 
KIM-expressing cell that would otherwise be triggered by interaction with endogenous 

25 KIM ligand could be blocked by contacting the KIM-expressing cell with an antagonist to 
KIM (e.g.., a blocking antibody that binds to KIM in a manner that prevents effective, 
signal-generating KIM ligation), or by contacting the endogenous KIM ligand with an anti- 
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KIM ligand antibody or other KIM ligand-binding molecule which prevents the effective 
ligation of KIM with the KIM ligand. 

Which of the interventions described above are useful for particular therapeutic 
uses depend on the relevant etiologic mechanism of either the pathologic process to be 
5 inhibited, or of the medically desirable process to be promoted, as is apparent to those of 
skill in the medical arts. For example, where KIM ligation results in desirable cellular 

r 

growth, maintenance of differentiated phenotype, resistance to apoptosis induced by 
various insults, or other medically advantageous responses, one of the above-described 
interventions that promote ligation-triggered response may be employed. In the alternative, 

10 one of the inhibitory interventions may be useful where KIM ligation invokes undesirable 

i 

consequences, such as neoplastic growth, deleterious loss of cellular function, 
susceptibility to apoptosis, or promotion of inflammation events. 

Following are examples of the previously described therapeutic methods of the 
invention. One therapeutic use of the KIM-related compounds of the invention is for 

15 treating a subject with renal disease, promoting growth of new tissue in a subject, or 

promoting survival of damaged tissue in a subject, and includes the step of administering 
to the subject a therapeutically effective amount of a KIM protein of the invention, or of a 
pharmaceutical composition which includes a protein of the invention. The protein used in 
these methods can be a unique fragment of a full-length KIM protein, a soluble KIM 

20 variant or a soluble ligand thereof, a KIM fusion protein, or a KIM agonist. These 

methods can also be practiced by administering to the subject a therapeutically effective 
amount of an agonist antibody of the invention, or a pharmaceutical composition which 
includes an agonist antibody of the invention. A KIM protein can be administered 
concurrently with a therapeutically effective amount of a second compound which exerts a 

25 medically desirable adjunct effect. While tissues of interest for these methods include any 
tissue, preferred tissues include renal tissue, liver, neural tissue, heart, stomach, small 
intestine, spinal cord, or lung. Particular renal conditions which are expected to be 
beneficially treated with the compounds of the invention include acute renal failure, acute 
nephritis, chronic renal failure, nephrotic syndrome, renal tubule defects, kidney 
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transplants, toxic injury, hypoxic injury, and trauma. Renal tubule defects include those of 

i 

either hereditary or acquired nature, such as polycystic renal disease, medullary cystic 
disease, and medullary sponge kidney. This list is not limited, and may include many other 
renal disorders (see, e.g., Harrison's Principles of Internal Medicine, 13th ed., 1994, which 
5,t is herein incorporated by reference.) The subject of the methods may be human. 

^ A therapeutic intervention for inhibiting growth of undesirable, KIM- or KIM- 

ligand-expressing tissue in a subject includes the step of administering to the subject a 
therapeutically effective amount of a KIM antagonist (e.g.., an antibody blocks KIM 

i i i. 

ligation). In an embodiment of interest, the KIM antagonist or anti-KIM antibody can be 
10 used therapeutically to inhibit or block growth of tumors which depend on KIM protein for 
growth. 

Other methods of the invention include killing KIM ligand-expressing tumor cells, 

*( * 

or inhibiting their growth, by contacting the cells with a fusion protein of a KIM and a 
toxin or radionuclide, or an anti-KIM ligand antibody conjugated to a toxin or radionuclide. 
15 The cell can be within a subject, and the protein or the conjugated antibody is administered 
to the subject. 

Also encompassed within the invention is a method for targeting a toxin or 
radionuclide to a cell expressing a KIM, comprising contacting the cell with a fusion 
protein comprising a KIM ligand and a toxin or radionuclide, or an anti-KIM antibody 
20 conjugated to a toxin or radionuclide. Another embodiment includes the method of 

suppressing growth of a tumor cell which expresses KIM, comprising contacting the cell 
with a fusion protein of KIM ligand and a toxin or radionuclide or with an anti-KIM 
antibody conjugated to a toxin or radionuclide; the cell may be within a subject, and the 
protein administered to the subject. 

25 The term "subject" as used herein is taken to mean any mammal to which KIM can 

be administered. Subjects specifically intended for treatment with the method of the 
invention include humans, as well as nonhuman primates, sheep, horses, cattle, goats, pigs, 
dogs, cats, rabbits, guinea pigs, hamsters, gerbils, rats and mice. 
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Use of Compounds of the Invention in Gene Therapy 

The KIM genes of the invention are introduced into damaged tissue, or into tissue 
where stimulated growth is desirable. Such gene therapy stimulates production of KIM 
protein by the transfected or transformed cells, promoting cell growth and/or survival of 
5 cells that express the KIM protein. 

In a specific embodiment of a gene therapy method, a gene coding for a KIM 
protein is be introduced into a renal target tissue. The KIM protein is expected to be stably 
expressed and stimulate tissue growth, division, or differentiation, or to potentiate cell 
survival. Furthermore, a KIM gene can be introduced into a target cell using a variety of 

10 well-known methods that use either viral or non-viral based strategies. 

i 

Once introduced into a target cell, sequences of interest can be identified by 
conventional methods such as nucleic acid hybridization using probes comprising 
sequences that are homologous/complementary to the inserted gene sequences of the 
vector. In another approach, the sequence(s) may be identified by the presence or absence 
15 of a "marker" gene function (e.g, thymidine kinase activity, antibiotic resistance, and the 
like) caused by introduction of the expression vector into the target ceil. 

Formulation 

In general, compounds of the invention are suspended, dissolved or dispersed in a 
pharmaceutical^ acceptable carrier or excipient. The resulting therapeutic composition 

20 does not adversely affect the subject's homeostasis, particularly electrolyte balance. Thus, 
an exemplary carrier comprises normal physiologic saline (0.15M NaCl, pH 7.0 to 7.4). 
Other acceptable carriers are well known in the art and are described, for example, in 
Remington's Pharmaceutical Sciences . Gennaro, ed., Mack Publishing Co., 1990. 
Acceptable carriers can include biocompatible, inert or bioabsorbable salts, buffering 

25 agents, oligo- or polysaccharides, polymers, viscosity-improving agents, preservatives, and 
the like. In some embodiments, the term "carrier" encompasses liposomes and the HIV-1 
tat protein (See Chen et al., Anal. Biochem. 227: 168-175, 1995) as well as any plasmid 
and viral expression vectors. 
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Any KIM compound of this invention can be used in the form of a pharmaceutically 

acceptable salt. Suitable acids and bases which are capable of forming salts with the 

i 

polypeptides, nucleic acids and vectors of the present invention are well known to those of 
skill in the art, and include inorganic and organic acids and bases. 

KIM compounds of the invention are dispersed in the carrier to concentrations 

H 

sufficient to deliver to the subject a therapeutically effective amount of the compound, 
which is an amount sufficient to produce a detectable, preferably medically beneficial 
effect in the subject. Medically beneficial effects would include preventing, delaying or 
attenuating deterioration of, or detectably improving/the subject's medical condition. It is 

10 expected that the concentration or amount of a KIM compound that will produce a 

1 1 i 

medically beneficial effect will vary considerably with the circumstances in which the 
invention is practiced. An effective amount can be determined by an ordinarily skilled 
physician or other practitioner through no more than routine experimentation. As an 
example, an indication of the status of renal injury or renal function can be monitored with 

15 one or more routine laboratory tests which measure the concentrations of relevant 

substances in blood or urine, other urine characteristics, or the rate of clearance of various 
substances from the blood into the urine. The parameters measured by these tests, either 
individually or in combination, can be used by a physician to assess renal function or 
damage. Examples of such parameters include the blood concentration of urea, creatinine 

20 or protein; the urine concentration of protein or of various blood cells such as erythrocytes 
or leucocytes; urine specific gravity; amount of urine; the clearance rates of inulin, 
creatinine, urea or p-aminohippuric acid; and the presence of hypertension or edema. 

In some embodiments, a KIM compound is formulated in a liposome delivery 
system, including without limitation any of a variety of unilamellar vesicles, multilamellar 
25 vesicles, or stable plurilamellar vesicles, all of which can prepared and administered 
according to methods well known to those of skill in the art, for example in accordance 
with the teachings of United States Patent 5,169,637, 4,762,915, 5,000,958 or 5,185,154. 
In addition, it may be desirable to express the novel polypeptides of this invention, as well 
as other selected polypeptides, as lipoproteins, in order to enhance their binding to 
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liposomes. As an example, treatment of human acute renal failure with liposome- 
encapsulated KIM protein may be performed |o vivo by introducing a KIM protein into 
cells in need of such treatment using liposomes. The liposomes can be delivered via 
catheter to the renal artery. The recombinant KIM protein is purified, for example, from 
5 CHO cells by immunoaffinity chromatography or any other convenient method, then mixed 
^ with liposomes and incorporated into them at high efficiency. The encapsulated protein , 
may be tested in vitro for any effect on stimulating tell growth. 

Routes of Administration 

i f 

The compounds of the invention may be administered in any manner which is 
10 medically acceptable. Depending on the specific circumstances, local or systemic 

administration may be desirable. Preferably, the compound is administered via a parenteral 
route such as by an intravascular, intravenous, intraarterial, subcutaneous, intramuscular, 
intratumor, intraorbital, intraventricular, intraperitoneal, subcapsular, intracranial, 
intraspinal, or intranasal injection, infusion or inhalation. The compound also may be 
15 administered by implantation of an infusion pump, or a biocompatible or bioerodable 
sustained release implant, or by installation of a catheter (e.g., in a renal artery), into the 
subject. Alternatively, certain compounds of the invention, or formulations thereof, may 
be appropriate for oral or enteral administration. Still other compounds of the invention 
will be suitable for topical administration. 

20 Treatment Regimes 

Determining appropriate dosage and frequency of treatment with any particular 
KIM compound to be administered to an individual is within the skills and clinical 
judgement of ordinary practitioners. The general dosage and treatment schedule is 
established by preclinical and clinical trials, which involve extensive but routine studies to 

25 determine the optimal administration parameters of the compound. Even after such 
recommendations are made, the practitioner will often vary these dosages for different 
individuals based on a variety of considerations, such as the individual's age, medical 
status, weight, sex, and concurrent treatment with other pharmaceuticals. Determining the 
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optimal dosage and administration regime for each KIM compound is a routine matter for 
those of skill in the pharmaceutical and medical arts. 

Equivalents 

The invention may be embodied in other specific forms without departing from the 
spirit or essential characteristics thereof. The foregoing embodiments are therefore to be 
considered in all respects illustrative of, rather than limiting on, the invention disclosed 
herein. Scope of the invention thus is indicated by the appended claims rather than by the 
foregoing description, and all changes which come within the meaning and range of 
equivalency of the claims are intended to be embraced therein. 

i 

! 

I 

i 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: Sanicola-Nadel , Michele 

Hession, Catherine A 
Wei , Henry 
Cate, Richard L 

(ii) TITLE OF INVENTION: MODULATORS OF TISSUE REGENERATION 

(iii) NUMBER OF SEQUENCES: 112 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Patent Administrator, Biogen, Inc. 

(B) STREET: 14 Cambridge Center 
<C) CITY: Cambridge 

(D) STATE: MA 

(E) COUNTRY: USA 

(F) ZIP: 02142 

(v) COMPUTER READABLE FORM: 

i(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS i 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 

(Vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 60/047,490 

(B) FILING DATE: 23-MAY-1997 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 60/047,491 

(B) FILING DATE : 23-MAY-1997 

(Viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Fenton, Gillian M 

(B) REGISTRATION NUMBER: 36,508 

(C) REFERENCE/ DOCKET NUMBER: A028PCT 

<ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (617) 679-2810 

(B) TELEFAX: (617) 679-2838 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 2058 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
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(vii) IMMEDIATE SOURCE: 
(B) CLONE: HW010 

(ix) FEATURE: 1 

(A) NAME /KEY: CDS 

(B) LOCATION: 8. ,646 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: complement (1618.. 1885) 
(D) OTHER INFORMATION: /label* SAC_23807 

t 

t 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

GTCGACA GCT CGG ATC CCC CCA TCA CCT ACC TCA CAG ACT GCT CAG ACA 49 
Ala Arg life Pro Pro Ser Pro Thr Ser Gin Thr Ala Gin Thr 

1 1 5 10 

GAA GGT AGT AGG ATC ACA TCC AAA ATG CCT GTG GCG TCT ACC CAG GGG 97 
Glu Gly Ser Arg lie Thr Ser Lys Met Pro Val Ala Ser Thr Gin Gly 
15 20 25 30 

TCC TCT CCC CCT ACC CCG GCT CTG TGG GCA ACC TCC CCC AGG GCT GCC 145 
Ser Ser Pro Pro Thr Pro Ala Leu Trp Al'a Thr Ser Pro Arg Ala Ala 

35 40 1 45 

i , 

ACC CTA CCT CCC AAG TCC TCC TCC ACC GGC ACA TCC TGT GAA CCA AAA 193 
Thr Leu Pro Pro Lys Ser Ser Ser Thr Gly Thr Ser Cys Glu Pro Lys 

50 55 60 

ATG GTC ATC AAC ACG GTC CCA CAG CTC CAC TCG GAG AAG ACA GTG TAT 241 
Met Val lie Asn Thr Val Pro Gin Leu His Ser Glu Lys Thr Val Tyr 
65 1 70 75 

CTC AAG TCC AGT GAC AAC CGC CTG CTC ATG TCT CTC CTC CTC TTC CTC 289 
Leu Lys Ser Ser Asp Asn Arg Leu Leu Met Ser Leu Leu Leu Phe Leu 
80 85 90 

TTT GTC CTC TTC CTC TGC CTC TTT TCC TAC AAC TGC TAC AAG GGC TAC 337 
Phe Val Leu Phe Leu Cys Leu Phe Ser Tyr Asn Cys Tyr Lys Gly Tyr 
95 100 105 110 

CTG CCC GGA CAG TGC TTA AAG TTC CGC TCA GCC CTG CTG CTC GCA AAG 385 
Leu Pro Gly Gin Cys Leu Lys Phe Arg Ser Ala Leu Leu Leu Ala Lys 

115 120 125 

AAA AAA CCT AAG TCA GAG TTC TCT GAC CTG GAG CAG AGT GTG AAG GAG 433 
Lys Lys Pro Lys Ser Glu Phe Ser Asp Leu Glu Gin Ser Val Lys Glu 

130 135 140 

ACG CTG GTA GAA CCT GGG AGC TTC TCG CAG CAG AAC GGC GAC CAG CCC 481 
Thr Leu Val Glu Pro Gly Ser Phe Ser Gin Gin Asn Gly Asp Gin Pro 
145 150 155 

AAG CCA GCC TTG GAT ACC GGC TAT GAA ACC GAG CAG GAC ACT ATC ACC 529 
Lys Pro Ala Leu Asp Thr Gly Tyr Glu Thr Glu Gin Asp Thr lie Thr 
160 165 170 

AGC AAG GTC CCC ACC GAT CGA GAG GAC TCG CAA CGT ATC GAC GAG CTC 577 
Ser Lys Val Pro Thr Asp Arg Glu Asp Ser Gin Arg lie Asp Glu Leu 
175 180 185 190 
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TCC GCC AGG GAC AAA CCG TTT GAT GTC AAG TGT GAA CTC AAG TTT GCA 625 
Ser Ala Arg Asp Lys Pro Phe Asp Val Lys Cys Glu Leu Lys Phe Ala 

195 , 200 205 

GAC TCG GAT GCC GAC GGG GAC TGAGGCCAGT GTGTCCCAGC CCGTGCCCCT 676 
Asp Ser Asp Ala Asp Gly Asp 

210 

if CCGTCTTCGT GGAGAGTGTT GTGTTGAACC TATTCAGTAG CCGAGTCTTG TCACTGTGCA 736 

'^CCAGCCTCAG TCTTTTGTCC CTTTCTCTCT TGGGTTGAGC CTGTGACTTG TCCCCTTTGT 796 

CCTTTTGGGA AACAAGTATC TATTCAAGCC TCAAGTCCCG GCAGTTGTTG GAGCGCTTAC 856 

TACACACCTG AGCCCTTTGT GTCCTGGGGG AGAGATGGCC ACCTCCGTGG GCTGAGAAGA 916 

ACCACCCCTT CCTCTTPCGC TCCTCGAAGC AGCCACTGAG AGATAATTTA ATTCCAGATT 976 

GGAAATGACC TTTTTGGTTT ATCAGATTGG TAACTTAATC TCCTGCCATC CGGGTGGCAC 1036 

t 

GGACGTTTTT CTTTCACTTC GTTATTTTTT TTAGGATTTG ' CGCTCCTACT GTGTTGATGT 1096 

CTTAGGTCAT TTTTTTTTTT TTAAGTTACC AGAGGAGATG TTTTGATATT CATGAGAAGA 1156 

AGAACATTTT CTAGATTTTT TTTTTTTTTG GTTACA^TATT GAG^TAAAAT ATGCCTCTGT 1216 

TGCTTAAGAT TCTCAGGgAt AGACGTGATT TTTGCTAATT TCTTTCCTGC TGTTCGGAAC 1276 

GTAGGCCTAA AACTGTCTCT TGAGCTCACA CACTCACCCT TTCCTTTTGG TGGGTTTTTT 1336 

TTTTTTTTTG TTTTGTTTTA TTAATCTTAC TCATTTCGAA GGATTTTCTT TATGAGCTTT 1396 

TGTTTTGTTT TTGTTTTGTT TTTTTGTTTT TCTCTTTTTC CTGCCGTACA TCATCTACGA 1456 

GGTGCGTTTT GAGTGAGGGC AGATGGCCCA GTGGCTTCGG GTGGCCCAGT GGCTGTGGGT 1516 

GGCAGTTGAG CTGGTCCTGT GAGGGGAGGA GGGCTTAGAC GCCACGGCCC TGCTGCTTCT 1576 

CGGCACCTCC TGTCCCGATG GGTGGCGTCT TCATGAGGAT TACATCCTGT CTCTGCTGGC 163 6 

TTCCATCTCG CACCTCTGCC CTTGGACTTC CACCTTGACT GTCCACAAAA GACAGAAATG 1696 

GGCTGGCTAG TTGGGCTCCC GGCCTCGGAC GGTGGCCGCA ACGTCCCGTG TGGGCGGCTG 1756 

CCGCTGCAGC CTGACTCCTG CCGGTGTCTT TCAGGATGTG AACGGGTGGT ACGAATCTTG 1816 

ACATTTGTTT CTCACTTCCT GTGTATGAAA CGACACTCAT TCCATGTAGA GGGTGACGGA 1876 

CTCTGGATCC CCCCCCTTGT CGTGTAGACA CTCATCTTCA GCGTGACCTG GTCCTGCCAT 1936 

TCGGTGTAAA CATTTGTGTT TATAAGATTT ACTTTGTTTT TATTTTTCTA CTTGGAACTG 1996 

TACATATTTG AAAAAGTACC CAAATAAACC AGAAGCTTTA TCGTTGAAAA AAAAAAGTCG 2056 

AC 2058 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 213 amino acids 

(B) TYPE: amino acid 
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(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Ala Arg He Pro Pro Ser Pro Thr Ser Gin Thr Ala Gin Thr Glu Gly 
15 10 15 

Ser Arg He Thr Ser Lys Met Pro Val Ala Ser Thr Gin Gly Ser Ser 

20 25 30 

Pro Pro Thr Pro Ala Leu Trp Ala Thr Ser Pro Arg Ala Ala Thr Leu 
35 40 45 

Pro Pro Lys Ser Ser Ser Thr Gly Thr Ser Cys Glu' Pro Lys Met Val 

50 55 60,' 

He Asn Thr Val Pro Gin Leu His Ser Glu Lys Thr Val Tyr Leu Lys 
65 70 75 80 

Ser Ser Asp Asn Arg Leu Leu Met Ser Leu Leu Leu Phe Leu Phe Val 

85 90 ' 95 

Leu Phe Leu Cys Leu Phe Ser Tyr Asn Cys Tyr Lys Gly Tyr Leu Pro 

100 105 , 110 

Gly Gin Cys Leu Lys Phe Arg Ser Ala Leu Leu Leu Ala Lys Lys Lys 
115 120 ( 1 125 

Pro Lys Ser Glu Phe Ser Asp Leu Glu Gin Ser Val Lys Glu Thr Leu 
130 135 140 

Val Glu Pro Gly Ser Phe Ser Gin Gin Asn Gly Asp Gin Pro Lys Pro 
145 150 155 160 

Ala Leu Asp Thr Gly Tyr Glu Thr Glu Gin Asp Thr He Thr Ser Lys 

165 170 175 

Val Pro Thr Asp Ara Glu Asp Ser Gin Arc lie Asp Gin Lev. Ser Ale 

180 18b 190 

Arg Asp Lys Pro Phe Asp Val Lys Cys Glu Leu Lys Phe Ala Asp Ser 
195 200 205 

Asp Ala Asp Gly Asp 
210 

( 2 ) INFORMATION FOR SEQ ID NO: 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1727 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 



(vii) IMMEDIATE SOURCE: 
(B) CLONE: HW011 
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(xi) SEQUENCE DESCRIPTION: SEQ,ID NO: 3: 
GTCGACATGA TTGCTAGTGG TTCTGCTGGC TGTGAGGGCT CCCTGTCTGT GAGATCTTGT 
TCGGGCTTTC CACATCCAGG ATCCATCCAT TCACCCCTCA CCCACTCACC CTGTGGAGCA 



11 TAAAACCTTT GCAGTCAGCC AGCATGGCAG AATGTCTATG CCTTGCTTCA GTTTCCAGGG 
'Vaactaagca TGTTTTAATC CCACTGCCCC ACCCACTATT GCTCATTCTG TCTAGTCTTC 
TGCTACCCAG CTTGTGTGGC TCCGTTCCTG TCACGCCTTC AGACGTCACT GTTCTGTGAC 
AGCGGTCTCT TTCCCCTCTG CTCCACCCAT TCTGAGCTCT GGGATAGTCC TAAGCCCTGC 
CCAGAGTAGG TGGACTGGTC GTTCCATGTC CGTGCTTGTG GGAGTTGTGG GGGTCTGTGG 
CATCTGCTAA GGAGGTTTAT CCTCACGTCA CQGAATCCGG CAAGATGCGC CATGGTCTAC 
ACTGCTTTCT CGAGCAGCGG CGAGAAAGGC 



TAACAAACTG TCTGAGAAAG GAAGTGGCCC 
AGGAGACACT GCCATGTTTC CGTCATTCTG 
TGAAGAATGA GGATACACCA TTCGACCTTC 
GTTGTTGTGA GTCCAGCTCA TCAGCATGGC 



CAGGATAACA ' CACGCCTCTC TGCCCTGAGA 

i 

GTGTCTGGAA CATTGGCCCC AGTAAAGCAG 
TTTCACtGCC TTGCTCTGGG GCTCAGGGGT 
AGGCCTTCCT CAGACCCTCT TACAAAGCCT 

! 

CGCCTTGTTC ACAGTTTAGC ACCAGTGTCT 



GCAACCGTGT CTGAAACGTA GCAGATGTCC AGTACTTTTT GTAAATGAAA AAATGAACAG 
AGTGAAGAGA AAGAGTCACC ATGAAATTTA CTAGTCCAGT GAGCTGACCG TAGGGGCAGT 
ATAAACATCC CCCAGCTCTT AATGTTCTGT CCTCAAGCTG TCCCCTCCCA CCTCTCCGCT 
GGCAGGAGAG TGCTCTGTAT ATGTGGACAG TCGTTGAATG GCAGGTCCAG GATGTCCTCT 
GCAGGTGGAC AAAAGAGAAT TCTGCTTCCC GGAAGTCTCC TTTGCTGGTG ATCTCCATGT 
CCATCCTCAC TGAGCACTAG AATTAAAACC ACTCAATAAT GTGATATGTG TGGTCACCTT 
CAGTTCAGTA GCAGTATTTC CTGGTCATTA ATATTCTGCA TTTTATTATG GTAAGATAGT 
ATAAGAAAAT GGTGGGCTGG TATATAGCCA GCCATGAGAC CACTCCTTAC GTCAAGGACT 
GGCAGAGTCT GAGGGACCTG AGTCTGGGTC CTCCTCCTGT CCCCTCCTCC TGTAGCTATG 
GTGGCCTTCA GTATTCTGAG ACTGAAGGCT ACTACTTCAG GACACCTACT TGATCTTAGT 
ACTCTCAGAA CAGCTCTCCT CAGTTCTAGC TGTGATCGCG GGCTGCGTTT GGAACGTTAA 
CGTACTAATG GGTTTAAGAC TTTCAGAAAG TGATTTTAAA ACAAAATGAC TGGAAGTTTT 
GTCTGATAAA GCAAGTAGAA TGGAATGTCT CAGACTGTCT CGTCACATTT TCATGTTTCA 
TACGAAAAGA AAAGATTGAT AACCTATCTT TAGAAATAGT AATGTTAGTT CTTTCCTATT 
TTGTTTAAAC CTTGAAGAAG GTTTTATTTA ACTCGTGAAC TTGTTCCTTT GTTATGACAT 
GATTGTTCCA GATTTGAGGG ACAAAATAAA ATTAACCTTT CTAAAAA 



60 
120 
180 
240 
300 
360 
420, 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1727 
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(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1060 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 



(vii) IMMEDIATE SOURCE: 
(B) CLONE: HW012 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 39.. 671 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

GTCGACGCAT, GGGTGCAGGT GACAAATGAG CTGAAGAA ATG ATC CCC AGA GGA 53 

Met lie Pro Arg Gly 

1 5 

ATA GGA TTG TGG CTG CCT TCA GCT CTG CTC CTC TCT CGG GTC CCA GGC 101 
He Gly Leu Trp Leu Pro Ser Ala Leu Leu Leu Ser Arg Val Pro Gly 

10 15 ' 20 

TGT TTC CCT CTG CGT GGC CCC AGC AGT GTG ACA GGC ACT GTG GGG GAG 149 
Cys Phe Pro Leu Arg Gly Pro Ser Ser Val Thr Gly Thr Val Gly Glu 

25 30 35 

TCC CTG AGC GTG ACT TGT CAG TAT GAG GAG AGA TTC AAG ATG AAT AAG 197 
Ser Leu Ser Val Thr Cys Gin Tyr Glu Glu Arg Phe Lys Met Asn Lys 
40 45 50 

AAA TAC TGG TGC AGA GGG TCA CAT GTT CTA CTT TGC AAA GAT ATT GTC 24 5 

Lys Ty* Cys Arg Gly Ser His Val Leu Leu Cys Lys Asp He Val 

55 60 65 

AAG ACC GGA GGC TCA GAA GAA GCT AGA AAT GGC CGA GTG TCC ATC AGG 293 
Lys Thr Gly Gly Ser Glu Glu Ala Arg Asn Gly Arg Val Ser He Arg 
70 75 80 85 

GAT GAT CCA GAC AAC CTG ACC TTC ACA GTG ACC TTG GAG AGC CTC ATC 341 
Asp Asp Pro Asp Asn Leu Thr Phe Thr Val Thr Leu Glu Ser Leu He 

90 95 100 

CTG GAG GAT GCA GGC ACC TAC ATG TGT GGG GTG GAT ATA CCA TTT ACT 389 
Leu Glu Asp Ala Gly Thr Tyr Met Cys Gly Val Asp He Pro Phe Thr 

105 110 115 

AAT CAC CCC TTG GGG ATT GAT GAG TTC TTC AAG GTT GAA TTG TCT GTG 437 
Asn His Pro Leu Gly He Asp Glu Phe Phe Lys Val Glu Leu Ser Val 
120 125 130 

GTC CCA GGT TCA AGC CTT TGG AGC AGT ACA ACA GTC CCA GAG ACC ATC 485 
Val Pro Gly Ser Ser Leu Trp Ser Ser Thr Thr Val Pro Glu Thr He 
135 140 145 



-40- 



SUBSTITUTE SHEET (RULE 26) 



• WO 98/53071 PCT/US98/10547 



AGA TCC TCA CTG GTT CAT ACT CAG CCC AGT GTG ACC ACA GAA GAC ACA 533 
Arg Ser Ser Leu Val His Thr Gin Pro Ser Val Thr Thr Glu Asp Thr 
150 155 160 165 

ATT CCT GCT CCT AGT GCA CGG CCT CGG TCT CTT CTG GGC AGC CTC TAC 581 
He Pro Ala Pro Ser Ala Arg Pro Arg Ser Leu Leu Gly Ser Leu Tyr 

170 175 180 

CTC TGC ATC CTG GTG TTT CTG GAG TTG CCC CTG TTC CTG TGC ATG CTC 629 
Leu Cys He Leu Val Phe Leu Glu Leu Pro Leu Phe Leu Cys Met Leu 

185 190 195 

TGT GCC GTC CTG TGG GTG AAC AGG CCT CAG AGG TGC AGT GGG 671 
Cys Ala Val Leu Trp Val Asn Arg Pro Gin Arg Cys Ser Gly 
200 205 ( 210 

TGACATAGCA TTGGGCCCTG TCATGAGTAC CAGTGAAGTC TGTTpATATG GAGGCCCTGT 731 

CCCTGGATGC ATCACCTCTG GTGGCCAAGG ACAACCAATA AAGCTGTGTT CTTGAGAATG 791 

CTCTGAGACT TTTAGAAGAT TCTCTGCACG TGTCAAACAC ATAAAAGAAT GTGCCCAATA 851 

AATACCAGCT GGGGAATTGG CTCAGCAGGT AACAACAATG GCTGCTTTTC CAGAGGTCCT 911 

GAGTTCAATT CCAATAACTA CATCGGTGGC TCACAACCAT CTCTAATGAG TTCAGATGCC 971 

i 

CTCTTCTGCT GTGCAGGTGC ACATTCACAT AGAATATTCA TACATTGAGT AAATTTATTA 1031 

AAAGTAATTG CAGAATTAAA ACACAAAAA , 1Q 60 



(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 211 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Met He Pro Arg Gly He Gly Leu Trp Leu Pro Ser Ala Leu Leu Leu 
15 10 15 

Ser Arg Val Pro Gly Cys Phe Pro Leu Arg Gly Pro Ser Ser Val Thr 

20 25 30 

Gly Thr Val Gly Glu Ser Leu Ser Val Thr Cys Gin Tyr Glu Glu Arg 
35 40 45 

Phe Lys Met Asn Lys Lys Tyr Trp Cys Arg Gly Ser His Val Leu Leu 
50 55 60 

Cys Lys Asp He Val Lys Thr Gly Gly Ser Glu Glu Ala Arg Asn Gly 
65 70 75 80 

Arg Val Ser He Arg Asp Asp Pro Asp Asn Leu Thr Phe Thr Val Thr 

85 90 95 

Leu Glu Ser Leu He Leu Glu Asp Ala Gly Thr Tyr Met Cys Gly Val 

100 105 110 
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Asp He Pro Phe Thr Asn His Pro Leu Gly He Asp Glu Phe Phe Lys 
115 120 125 

Val Glu Leu Ser Val Val Pro Gly Ser Ser Leu Trp Ser Ser Thr Thr 
130 135 140 

Val Pro Glu Thr He Arg Ser Ser Leu Val His Thr Gin Pro Ser Val 
145 150 155 160 

Thr Thr Glu Asp Thr He Pro Ala Pro Ser Ala Arg Pro Arg Ser Leu 

165 170 175 

Leu Gly Ser Leu Tyr Leu Cys He Leu Val Phe Leu Glu Leu Pro Leu 

180 185 i 190 

Phe Leu Cys Met Leu Cys Ala Val Leu Trp Val Asn Arg Pro Gin Arg 
195 200 205 

Cys Ser Gly 
210 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 3385 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOliOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: HW013 



(XI) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

GTCGACTTTT TTTAACAAAG ATGATTTAAT ATTTAATATC ATACCCAGAA AAATCAAACA 60 

ACTTCCAGAA TCAGACAGGT AAGATTCCTT AGTATGTTTT TACCAAAATT TTATAAGATT 120 

ATCTTAAATT TGAAGAATTG TGCATAAATA ACACAGGATA CCAAAGGAAA CTGAGCTGTA 180 

TTTTTTGCTT TGATGGGAAA AAAAAAAAAA AAAACCTGAA ACCCAGAGAC ACTAACCATT 240 

TTGTGCACTG TATAACTCAT TGTGGAAGCT GAAACTTAGA ACTCATCTTT GGAAATATAT 300 

AATTGATGAA CCCACCTTTC CATTCAGCTT TTAGGGTGAT GTAAAACTGT ATGAATCCAC 360 

TTACAAATAA ACTTAAAAGA CCTTTTCATG TTACCCTATC TCTTATCTTC TCAATGTATT 420 

TCTATATGCT TACCTTTAGT AAGAGTACCA TGGAGTACTT TGATAGATGG AATGACAGAG 480 

AAATGGATGT TTAAAGATCT TTAGGATGAA AAGGATAATT TAAGAAGCAT ATTTAAGAAG 540 

CATGCTGATA TTGAAAAGGT AGTATAGGCA AGTACAGTGC ACTTCCACAG ACTACTCATA 600 

CCTTCTTGTC CATCAGAAAG TCTGAGTGGG TTAGTTACAG TAAGGTCATG TTGATCTGGG 660 
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TCTATGCACC 


TGTGCTGAGG 


TCAGAGGACA 


ACTTTCAGGA 


GTCTGTTCTG 


CCCTTCCGTG 


720 


TGGGTCCTAA 


GTATGCAACT 


CAAAGGTTGT 


CTGTCAGGTT 


GATTCTGCCC 


AGTCATGAAA 


780 


CCTTTGAGAA 


GAGGAAAAGG 


TGTAGGAAGA 


ACAGGAACAG 


GATCATAGAA 


GTGACTAAGC 


840 


CAGAATCTGT 


CATATTTGAG 


GAAACATCAA 


GAGGTGCCTA 


TAATAGGGTT 


CCGAGAATGA 


900 


AAAAAAAAAC 


CAGATAACTT 


TACTGTCTAT 


TCCAGCCAGT 


TTGACTTAGA 


GGCACTGGCA 


960 


AAGTAGGTAC 


AGACAGCTGA 


GGTAGAAAAT 


GTGGAACATT 

i 


ACTGTGGCAA 


CTGCTGCATC 


1020 


TTCGGGTGTG 


GAAGTCACAC 


CAATTCCACT 


TTTGGTTGGC 

i 


ACTTCCCATC 


TTCTAAGAGA 


1080 


GAACCTTTAA 


TTGAGCGGCC 


GTGAAGAACT 


TGATAAATCT 


ACGCTGCTGT 


GTGTCTCTCA 


1140 


CCACCACCCT 


ACCTACATAG 


ACACCCCCAT 


TTAGTTGTTT 


CCTGAGAATC 


AGAACAGAGC 


1200 


ACTTACACGA 


GGTACGGGAA 


GCTGTGATGC 


CTAGTGTAAG 


AAGTTAGAAG 


ATGACTGAAC 


1260 


ACAGGAAGAG 


AGCGGGCTTG 


CAGTGGGATC 


TAAATCAAAT 


CAGCGAGGAT 


CACTGGAAAC 


1320 


TGAGCTGAAA 


CTCAACTAAG 


ATTTAGCTTG 


CAAACAATAG 


AGATGTTTCA 


TTTTAAAACC 


1380 


ACAGCAGATT 


AACAGTATAG 


CTGGGAAACA 


ACCAAGTGTT 


CTTTTCAAAT 


TTCAGTTCCA 


1440 


CTATATCTGA 


GCATGGTfAA 


AATAGGGGTG 


AGTCCTATTT 


CATGGCTTTT 


ATCTATTAAA 


1500 


TATTTGTAAA 


CGTGCTCAAG 


TTCCCATTTC 


TTTAACCCTA 


CAAAATTCAG 


GCTTCTCACA 


1560 


CTGGGGCTGT 


AGTTCTTTAA 


TAGTTTTCTA 


ATTCCTGAGC 


CATCTTTAGA 


CTTCTGAAAA 


1620 


CCTCAACTTC 


AGTCTCTTCT 


CACATAGCCA 


ACTATGAGAA 


CCTCCAAAGC 


AGAATGCAGT 


1680 


TTTCCTTGTG 


CCTTTAGCAA 


CAAGGCACCT 


TTACAAAACT 


GGTTCTCAGT 


GAATGAAATG 


1740 


AAAGCATGAT 


CTTACAGTGT 


GCGACACTTC 


AGTAGTGATG 


ACTTAGAATA 


GTTTTAAAAT 


1800 


AAATAAAATT 


ACAGAACCTA 


GGGGAGGTGA 


AAAATGGGAA 


ACATGAAACA 


GTAGACCATA 


1860 


CTGCAGCTCA 


CAAAACAAGC 


AGTGAAAACA 


TATAAACTAC 


CTATAATAGC 


CTCATTATAA 


1920 


CATGCAAACA 


CACTAACTCT 


GTTACATACA 


GATTGGATGT 


ATATCTGAAG 


TTGTTTATAG 


1980 


ATACAATGGG 


TTTAAGTTGG 


TAATTAATAA 


ATAGTACTAC 


CTTAGAATGT 


GTTGTGCTTT 


2040 


TAAGTATTAT 


GAAAGCTCAA 


GAGCCAGGCA 


TCTTCCTCTT 


CAGAGCTGTT 


TCCTGTTACA 


2100 


TAATTACTGG 


CAAAGTTCTA 


AACAGTTCTT 


TACTAAATCA 


CAAAATGCAT 


ACAGCTAGTG 


2160 


CCATTGTCTC 


AAGTCTTTTT 


AGAATAGAAA 


TGTCGATGTG 


AAGCTATAAT 


TTTATTTTGG 


2220 


TTTTATGATC 


CTGCAAGGGT 


TAAGTTTGTC 


TGATTCTTCC 


CAGGGTACTG 


ATAAGTTCTC 


2280 


TAACATACCG 


GAGTAAGAGA 


ACTGAAACAG 


GTTATTTCCC 


TCATCCTCAG 


GAATCTGCTT 


2340 


GAATATGGAT 


CAGCATATAT 


TGGACTTAAT 


GCTGCTTTTG 


GGGGCCTAAT 


AGCAAACAGT 


2400 


CTATTTCGGC 


GAATTTTGAA 


CGTGACACAA 


GCTCGAGTAG 


CATCTAGCTT 


GCCAATGGCT 


2460 


GTGATCCCAT 


TTTTGACAGC 


AAACCTGTCC 


TACCATAGTT 


TTGTAAGTTT 


ACCTTTGAGT 


2520 


ACAGGTATGT 


TTTATTTCAT 


TATCATCCAA 


GCCTGCATTT 


ATGCATACTA 


TATACACTTA 


2580 
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TATATACTTC ACTCAAAACT GAAGGATTAG AACCGAGTAG GTGTGAGCTT TAATGAAGTG 2640 

TTTAGGGAAG ACCAAATCTA TTCAACAACT GATTCAGCTT AGGTCTGCCA ACCAACCAGT 2700 

GAGTACATAT CCACAGCCGT CAGTCACACA CTTCCCTGAC CTGAAGCAGC ATTCCCAGTG 2760 

CTGCCTTTGT TTCTTGTTAA TTTGCACATT ATATGCAAGT TTTCTCTTTT TAAAAAGATG 2820 

TTTTTATTTA GGTATGTGTT TTGTCTGACT GTACATTTGC ACACCAGGTA GTGGTACTGG 2880 

ACCCCATATA CTAGTTACAG ATAAGGGTGA GCTGCCACGT GGGTGCTGGT AATTGAACTC 2940 

AGCACTTAGT GCCATTGCTC TTAATTGAGA TCTCTCTCCA GCCCTGTATG AAAGTTCTTT 3000 

AAAGTATAAG TATGTTGTGG GAATACAACT CTGTAATCTC TGAAGTACTA TGTGCATTGC 3060 

AGCATTTAGC AAAATGAAAG ACATAGTGTC CCATGTGTCC TGAa!cATGTA TTGTTATGTG 3120 

TTCTATATGC ATGACCCTTC TTATTAAAAA CAGATTGGGG AAAGGGGTCA ATTTTGTGTT 3180 

TTCTCCTAAA TAATAATCCA AATTTCTGAG AATCCAGAGT CA^GATTTGT TTTCCTGTTT 3240 

ATCAGTATTT .CCTATCCAAG GCCTTAATAC ATGCTAAGCA TGTATTCACA CCAAGCTACA 3300 

GTCCATCCTC TATAATTATT CCTTTGACAT GATATGCACC ATTTTTGCTG TGTCACTGAA 3360 

AATAAAATGT GAACCTTATG TCGAC 3385 
(2) INFORMATION FOR SEQ ID NO: 7: 

i i 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1779 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

( i d j ) HYPOTHETICAL : NO 



(vii) IMMEDIATE SOURCE: 
(B) CLONE: HW014 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 175. .804 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

GTCGACCTGG AAAGAGCTGC AGTGTTCGCG CTTGGTAGCT GGTGGACCGG GCTCAGCACC 60 

TTCGGCTGGC TTTGTGTCCC AGAGGCTCAC CGGAAAAAGA CTTTCTCAGC CCTCGGACTC 120 

CAGAGAGAGA TACTTTTCGT GCTCCCCAAC CTTGACTCCT GCAGAAGCTG AGCG ATG 177 

Met 
1 

GCG TCC ATG GGG CTG CAG GTG CTG GGA ATC TCC TTG GCC GTC CTG GGC 225 
Ala Ser Met Gly Leu Gin Val Leu Gly lie Ser Leu Ala Val Leu Gly 

5 10 15 
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TOG TTA GGA GTC ATC CTG AGT TGT TCG CTC CCC ATG TGG CGG GTG ACC 
Trp Leu Gly Val lie Leu Ser Cys Ser Leu Pro Met Trp Arg Val Thr 
20 25 30 

GCC TTC ATC GGC AGC AAC ATC GTC ACG GCG CAG ACG AGC TGG GAA GGC 
Ala Phe lie Gly Ser Asn lie Val Thr Ala Gin Thr Ser Trp Glu Gly 
35 40 45 

CTG TGG ATG AAC TGC GTG GTG CAG AGC ACC GGC CAG ATG CAG TGC AAG 
Leu Trp Met Asn Cys Val Val Gin Ser Thr Gly Gin Met Gin Cys Lys 
50 55 60 65 

ATG TAC GAC TCG ATG CTG GCC CTG CCG CAG GAC CTG CAG GCC GCC CGA 
Met Tyr Asp Ser Met Leu Ala Leu Pro Gin Asp Leu Gin Ala Ala Arg 

70 75 1 80 



273 



321 



369 



417 



GCC CTG ATG GTC ATC AGC ATC ATC GTG GGT GCT CTG GGG ATG CTT CTC 
Ala Leu Met Val lie Ser lie lie Val Gly Ala Leu Gly Met Leu Leu 

85 90 95 

TCA GTC GTA GGG GGC AAG TGC ACC AAC TGC ATG GAp GAC GAG ACC GTC 
Ser Val Val Gly Gly Lys Cys Thr Asn Cys Met Glu Asp Glu Thr Val 
100 105 110 

AAG GCC AAG GTC ATG ATC ACA GCC GGA GCC GTG TTC ATC GTG GCA AGC 
Lys Ala Lys Val Met He Thr Ala Gly Ala Val Phe He Val Ala Ser 
115 120 125 

ATG CTG ATT ATG GTG CCT GTG TCC TGG ACG GCA CAC AAC GTC ATC CGC 
Met Leu He Met Val Pro Val Ser Trp Thr Ala His Asn Val He Arg 
130 135 140 145 

GAC TTC TAC AAC CCT CTG GTG GCT TCC GGG CAG AAA AGG GAG ATG GGG 
Asp Phe Tyr Asn Pro Leu Val Ala Ser Gly Gin Lys Arg Glu Met Gly 

150 155 160 

GCC TCG CTT TAC ATC GGC TGG GCG GCT TCT GGG CTG CTG CTC CTG GGA 
Ala Ser Leu Tyr He Gly Trp Ala Ala Ser Gly Leu Leu Leu Leu Gly 

165 170 375 

GGA GGC CTC CTC TGC TGC AAT TGC CCA CCT CGC CGC AAC GAA AAG CCC 
Gly Gly Leu Leu Cys Cys Asn Cys Pro Pro Arg Arg Asn Glu Lys Pro 
180 185 190 

TAC TCC GCC AAG TAC TCC GCC GCC CGC TCT GTT CCC GCC AGC AAC TAT 
Tyr Ser Ala Lys Tyr Ser Ala Ala Arg Ser Val Pro Ala Ser Asn Tyr 
195 200 205 

GTG TAAGGTGGGC CACTCTCTGT CCACATTGCC TTTATTTTCT TGGATTGAAC 

Val 

210 

TCATAACGGC CTGTGGCCCC TCACATTCTC CAGGACCTGA CCAGCTGTGG GCTACTGACT 
GCTTGCAAAC CCGGACTGTG CTAAGTTACT AGCGTGTAGC CCTTGGGGAC CCACCTGGCC 
CATCTGGACA CATCTCAAGG CTCCAGCGAG GATAGATGTA AAAATATTTC CTTGCTTGCA 
TCCAGATTGC TCATGGATAC GGGGCTGAAG GCAGAAGCAG CTGTCTGGGT ACGACAGTGG 
AGGGGGAGCT GGGTCCTGCT GGCCGGGATA GCTCAGCTGT GACTTTGGTC TCTGGAGTGG 



465 



513 



561 



609 



657 



705 



753 



801 



854 



914 
974 
1034 
1094 
1154 
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il 



ATGTCCTGGT 


CATGTTAGCA 


AACATTCACT 


GCCCTTTCTC AGTGCCCTCG 


CTCTCTCGCC 


1214 


TCCACGTTAC 


TCCCGCGCTA 


CTCTTGCCGT 


TTCTCGCCCG 


CGTTTCTGAG 


CACACCAGGT 


1274 


CCTGCCTGGA 


GTCTTGGTGT 


CGAGGATGAC 


i 

TGACTGAAGG 


pGCCTTTGAG 


AGCTGATGGG 


1334 


TTCTGCCATG 


GACTCCTCCC 


GGTGATTAGC 


AATGACTGGG 


GCCTTACCCA 


CCCACCTACC 


. 1394 


CTCGTAATGA 


AGTTCTGTGG 


AGTGGCTGGA 


CAGGTTTGAG 


GGAAGGGTGG 


AGGTGGTTTA 


1454 


AACTGGTTTG 


GGGAGTGCTA 


GGGCTGGGGA 


CCCAGAAGCA 


GCCCAGGGTG 


TCCCCACCCC 


1514 


TTTCCCATAC 


GGTCTTGCTA 


AATGTTCTGA 


i 

TCTCTGTATA 


CCCCCTCCCT 


CTTCAGAAGG 


1574 


ACCCTGGGTG 


GGCCCCTCTG 


AATTCCCTAC 


CCTTGTCCAT 


i 
I 

TTCAAGGACG 


CTGGCCAGTC 


1634 


TGTGGAAGGT 


ACGGGGGTCT 

1 i < 


GATGGCATTG 


CACCAGGGAG 


CCTCCTGGAC 


TCCCTTGCCT 


1694 


TCTCTGTGGT 


TTCTTG^TTT 


GTAATTTAAG 


GTCTGTTCAC 


AGCTGTAATT 


ATTATTTTCT 


1754 


ACAATAAATG 


GCACCTGCAG 


t TCGAC 




i 




i 

1779 



(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 210 amino acids 

(B) TYt>E: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Met Ala Ser Met Gly Leu Gin Val Leu Gly He Ser Leu Ala Val Leu 
15 10 15 

Gly Trp Leu Gly Val He Leu Ser Cys Ser Leu Pro Met Trp Arg Val 

20 25 30 

Thr Ala Phe He Gly Ser Asn He Val Thr Ala Gin Thr Ser Trp Glu 
35 40 45 

Gly Leu Trp Met Asn Cys Val Val Gin Ser Thr Gly Gin Met Gin Cys 
50 55 60 

Lys Met Tyr Asp Ser Met Leu Ala Leu Pro Gin Asp Leu Gin Ala Ala 
65 70 75 80 

Arg Ala Leu Met Val He Ser He He Val Gly Ala Leu Gly Met Leu 

85 90 95 

Leu Ser Val Val Gly Gly Lys Cys Thr Asn Cys Met Glu Asp Glu Thr 

100 105 HO 

Val Lys Ala Lys Val Met He Thr Ala Gly Ala Val Phe He Val Ala 
115 120 125 

Ser Met Leu He Met Val Pro Val Ser Trp Thr Ala His Asn Val He 
130 135 140 

Arg Asp Phe Tyr Asn Pro Leu Val Ala Ser Gly Gin Lys Arg Glu Met 
145 150 155 160 
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Gly Ala Ser Leu. Tyr lie Gly Trp Ala Ala Ser Gly Leu Leu Leu Leu 

165 170 175 

Gly Gly Gly Leu Leu Cys Cys Asn Cys Pro Pro Arg Arg Asn Glu Lys 

180 185 190 

Pro Tyr Ser Ala Lys Tyr Ser Ala Ala Arg Ser Val Pro Ala Ser Asn 
195 200 205 

Tyr Val 
210 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: ' 

(A) LENGTH : 2537 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA I 
(iii) HYPOTHETICAL: NO 



(vii) IMMEDIATE SOURCE: 
<B) CLONE: HW015 

i 

(ix) FEATURE: 

(A) NAME /KEY: CDS • 

(B) LOCATION: 11.. 2176 

(D) OTHER INFORMATION: /product= "predicted polypeptide 11 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

GTCGACCAAA ATG GCA GCC TTG GAA GAA GAA TTC ACG TTG TCT ACT GGA 49 

Met Ala Ala Leu Glu Glu Glu Phe Thr Leu Ser Thr Gly 

3 5 10 

GTC TTA GGC GCC GGG CCT GAA GGA TTC CTA GGT GTA GAG CCC AGT GAC 97 
Val Leu Gly Ala Gly Pro Glu Gly Phe Leu Gly Val Glu Pro Ser Asp 
15 20 25 

AAG GCC GAC CAG TTC CTA GTG ACC GAC AGA GGC AGG ACC GTC GTC CTC 145 
Lys Ala Asp Gin Phe Leu Val Thr Asp Arg Gly Arg Thr Val Val Leu 
30 35 40 45 

TAC AAG GTT TCT GAC CAG AAA CCC TTG GGC AGC TGG TCG GTG AAA CAA 193 
Tyr Lys Val Ser Asp Gin Lys Pro Leu Gly Ser Trp Ser Val Lys Gin 

50 55 60 

GGC CAG ACT ATA ACA TGT CCT GCT GTG TGC AAC TTT CAG ACT GGA GAG 241 
Gly Gin Thr lie Thr Cys Pro Ala Val Cys Asn Phe Gin Thr Gly Glu 

65 70 75 

TAT ATT ATG GTA CAT GAC CAT AAG GTT TTG AGA ATA TGG AAT AAT GAA 289 
Tyr He Met Val His Asp His Lys Val Leu Arg He Trp Asn Asn Glu 
80 85 90 

GAC GTA AAC CTG GAT AAA GTA TTC AAA GCT ACA CTG TCA GCT GAG GTC 337 
Asp Val Asn Leu Asp Lys Val Phe Lys Ala Thr Leu Ser Ala Glu Val 



-47- 



R! INSTITUTE SHFPT fRULF 



WO 98/53071 



PCT/US98/10547 



95 100 105 

CAT AGG ATC CAC TCA GTA CAA AGA ACA GAA CCC CTG GTG CTG TTC CGA 385 
His Arg lie His Ser Val Gin Arg Thr Glu Pro Leu Val Leu Phe Arg 
110 115 120 125 

AGA GGC GCT GCT CGT GGG TTA GAG GCC TTG CTT GCA GAG CCA CAG CAG 433 
Arg Gly Ala Ala Arg Gly Leu Glu Ala Leu Leu Ala Glu Pro Gin Gin 

130 135 140 

ii 

AAC ATT GAA GCT GTC ATA CCA GAT GAG GAA GTG ATC GAG TGG TCA GAG 481 
P Asn lie Glu Ala Val lie Pro Asp Glu Glu Val lie Glu Trp Ser Glu 

145 150 155 

GTT TTC ATG TTA TTT AAG CAA CCG GTT TTA ATT TTT ATT ACT GAA AAT 529 
Val Phe Met Leu Phe Lys Gin Pro Val Leu lie Phe lie Thr Glu Asn 
160 165 170 

CAT GGG AGT TAT GTT GCT TAT GTA CAA TTG TGC AAA TCA CAC AGC TTA 577 
His Gly Ser Tyr Val Ala Tyr Val Gin Leu Cys Lys Ser His Ser Leu 
175 180 185 

AGC AAA TAC ATA CTC TTA CTT GGA AAA GAA GAA AAA TCT GCT AAA CCA 625 
Ser Lys Tyr lie Leu Leu Leu Gly Lys Glu Glu Lys Ser Ala Lys Pro 
190 195 1 200 205 

AAT TTT ACT GCA CGT GTG GAT GGG AAA TTC ATC TCC CTG GTG TCG CTA 673 
Asn Phe Thr Ala Arg 1 Val Asp Gly Lys Phe lie Ser Leu Val Ser Leu 

210 215 220 

GGC TCT GAT GGG TGT GTA TAT GGA ACC TTG GTA CCA ATA TAT TCA AGT 721 
Gly Ser Asp Gly Cys Val Tyr Gly Thr Leu Val Pro lie Tyr Ser Ser 

225 230 235 

GAC ACG GAA AAC AAT 'CAG AGG TTA GTT AGA GCA TTG ATG CTC AAG ACG 769 
Asp Thr Glu Asn Asn Gin Arg Leu Val Arg Ala Leu Met Leu Lys Thr 
240 245 250 

GTT GTG TCT GGC AGT GCT CGA AAT GGT TCT GCC CTC ACC ATC CTG GAT 817 

Val Val Ser Gly Ser Ala Arg Asn Gly Ser Ala Leu Thr He Leu Asp 
25b 260 265 

CAA GAC CAC ATA GCA GTC CTG GGA CCT CCA CTT CCA GCT TCT AAG GAA 865 
Gin Asp His lie Ala Val Leu Gly Pro Pro Leu Pro Ala Ser Lys Glu 
270 275 280 285 

TGC CTC TCC ATA TGG AAC ATA AAA TTT CAG ACA TTA CAG ACA TCA AAA 913 
Cys Leu Ser lie Trp Asn lie Lys Phe Gin Thr Leu Gin Thr Ser Lys 

290 295 300 

GAG CTG CCA CAA GGA ACC AGT GGG CAA CTC TGG TAT CAT GGG GAA ATA 961 
Glu Leu Pro Gin Gly Thr Ser Gly Gin Leu Trp Tyr His Gly Glu lie 

305 310 315 

CTA TTT ATG CGA CAT GGG AAA AGT CTA ACT GTG ATT CCA TAC AAG TGC 1009 
Leu Phe Met Arg His Gly Lys Ser Leu Thr Val lie Pro Tyr Lys Cys 
320 325 330 

GAA GCA TCA TCT CTG GCG AGC GCT CTT GGG AAG CTC AAG CAT ACT CAA 1057 
Glu Ala Ser Ser Leu Ala Ser Ala Leu Gly Lys Leu Lys His Thr Gin 
335 340 345 

GAG TCA GGC AGT CAT TCC GTG CCC CAT TTT GTA AAC TGG GAG ACG TGT 1105 
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Glu Ser Gly Ser His Ser Val Pro His Phe Val Asn Trp Glu Thr Cys 
350 355 360 365 

TCA GAA TAT GAA CTT GGG TCC TAC AGT GCA AAG CAG ACA AGA ACT CTT - 1153 
Ser Glu Tyr Glu Leu Gly Ser Tyr Ser Ala Lys Gin Thr Arg Thr Leu 

370 375 380 

AGA AAA AAA ACT GAA ACA AAT TTA CAC CCA GAA GTT CCA GGA GTC AAA 1201 
Arg Lys Lys Thr Glu Thr Asn Leu His Pro Glu Val Pro Gly Val Lys 
,f 385 390 395 

1 GAA CTT TTA TCA ATA ATA AAG AAA GAT TCA GAA AAG CAC ATT GAA GTA 1249 
Glu Leu Leu Ser lie lie Lys Lys Asp Ser Glu Lys His lie Glu Val 
400 405 ' 410 



GAA CTA CGT AAG TTT TTG GCT AAG TCG ACA CCT GAC TTT CAT ACT ATA 1297 
Glu Leu Arg Lys Phe t Leu Ala Lys Ser Thr Pro Asp Phe His Thr lie 
415 , , 420 425 

ATT GGA GAC ATA GTG TCT GGA CTT CTG GGA AGA TGT AAA GTG GAA CCA 1345 
lie Gly Asp lie Val Ser Gly Leu Leu Gly Arg Cys Lys Val Glu Pro 
430 435 440' 445 

TCG TTT TAC CCC CGG AAC TGT CTG ATG CAG CTC ATC CAG ACA CAC GTG 1393 
Ser Phe Tyr Pro Arg Asn Cys Leu Met Gin Leu He Gin Thr His Val 

450 455? 460 

CTT TCC TAC AGC TTA 1 TGC CCT GAC TTG ATG GAG ATT GCC CTA GAG CAC 1441 
Leu Ser Tyr Ser Leu Cys Pro Asp Leu Met Glu He Ala Leu Glu His 

465 470 475 

ACA GAT GTG CAG ATG TTA CAG CTG TGT CTG CAG CAG TTC CCT GAC ATT 1489 
Thr Asp Val Gin Met Leu Gin Leu Cys Leu Gin Gin Phe Pro Asp He 
480 485 490 

CCT GAG TCC ACC ACC TGT GCT TGC TTA AAA CTT TTC CTG AGC ATT GGT 1537 
Pro Glu Ser Thr Thr Cys Ala Cys Leu Lys Leu Phe Leu Ser He Gly 
495 500 505 

GAT GAC TGT CTT CGG GAC AGT AAT GTC AGC ATG GAG TCA GTT TTT GAT 1565 
Asp Asp Cys Leu Arg Asp Ser Asn Val Ser Met Glu Ser Val Phe Asp 
510 515 520 525 

TAT GGT GAT ACT ACA CAA GAT GAG AAA AAG GAA ATG GAA GAG CAA ACT 1633 
Tyr Gly Asp Thr Thr Gin Asp Glu Lys Lys Glu Met Glu Glu Gin Thr 

530 535 540 

GAA ATT GTT CAG AAT GGC TTC GGT CCT GAA GAT GGT GGC TGC AGT AAA 1681 
Glu He Val Gin Asn Gly Phe Gly Pro Glu Asp Gly Gly Cys Ser Lys 

545 550 555 

GAT AGT CAG CAG TTG AAC AAA AAG CCT GGT GAC ACA GCA CAG GAG CCC 1729 
Asp Ser Gin Gin Leu Asn Lys Lys Pro Gly Asp Thr Ala Gin Glu Pro 
560 565 570 

ATC TCC TTC CCT GTG ACC TTG TGT CCT GTG GCA CCA AAG CGA GCA GCT 1777 
He Ser Phe Pro Val Thr Leu Cys Pro Val Ala Pro Lys Arg Ala Ala 
575 580 585 

CTG CTA AAT GCA GTC CTT CAT TCA GCA TAC AGT GAG CCC TTC CTC CTG 1825 
Leu Leu Asn Ala Val Leu His Ser Ala Tyr Ser Glu Pro Phe Leu Leu 
590 595 600 605 
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CCG CAC TTG AAG GAC ATC CCT GCG AAG CAT ATA ACG CTG TTT CTG CAG 
Pro His Leu Lys ,Asp lie Pro Ala Lys His He Thr Leu Phe Leu Gin 

610 615 620 

TAT TTG TAT TTC CTC TAT TTG AAG TGC ACT GGC AGT GCT ACC ATG ACT 
Tyr Leu Tyr Phe Leu Tyr Leu Lys Cys Thr Gly Ser Ala Thr Met Thr 

625 630 635 

CTC CCT GGA GTA AGC CCT CCA ACC GTG AGC CAG ATC ATG GAT TGG ATA 
Leu Pro Gly Val Ser Pro Pro Thr Val Ser Gin He Met Asp Trp He 
640 645 650 

TGC CTA CTT CTA GAT GCT AAT TTT ACT GTC GTC TTA ATG ATA CCA GAA 
Cys Leu Leu Leu Asp Ala Asn Phe Thr Val Val Leu Met He Pro Glu 
655 660 665 

GCA AAA AGA CTT TTG CTT AAT CTT TAC AAT TTT GTG, AAA TCG CAG ATC 
Ala Lys Arg Leu Leu Leu Asn Leu Tyr Asn Phe Val Lys Ser Gin He 
"0 675 680 685 

TCC ATC TAC TCC GAG CTC AAC AAG ATT GAA GTC AGC TTC CGG GAG CTG 
Ser He Tyr Ser Glu Leu Asn Lys He Glu Val Ser Phe Arg Glu Leu 

690 695 700 

CAG AGG TTA AAT CAG GAG AAG AGC AGT AGA GGA CTG TAC TCC ATC GAA 
Gin Arg Leu Asn Gin Glu Lys Ser Ser Arg Gly Leu Tyr Ser He Glu 

705 710 715 

GTT CTG GAA CTT TTC TGACAGCCAG AGACAGTCTT ACTGGTTGTC CTAGAACTGG 
Val Leu Glu Leu Phe 
720 



1873 



1921 



1969 



2017 



2065 



2113 



2161 



2216 



ATATATAGAC 


CAAGCTGGCC 


TCAAACTCAC 


AGAGATCCAA 


CCCAGAGCAC 


TGCGATTAAA 


2276 


GGTGTGGCCA 


CCACACCTGG 


CTTAATGGAT 


TTTTTTATAT 


GAATGTACCT 


GTTCATCCAG 


2336 


GGGAGAAAGT 


TTTGTTTGTA 


TCCATCTCAG 


TGTCCAGAGA 


GGAGTGTGAG 


TCTCTGCACC 


2396 


ATCACATACC 


ACCATTCCTC 


AGTGCTACGG 


GTTCCACAGA 


CATCACTTCT 


GGGTTGCAGA 


2456 


CATCACTGGA 


AGAATGTTAC 


AAACTTTTTA 


AACATGGAAT 


TGAGTTGATT 


TTAAGTAAAC 


2516 


TTATTTGTGT 


ACTGATAAAA 


A 








2537 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 722 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Met Ala Ala Leu Glu Glu Glu Phe Thr Leu Ser Thr Gly Val Leu Gly 
1 5 10 15 

Ala Gly Pro Glu Gly Phe Leu Gly Val Glu Pro Ser Asp Lys Ala Asp 

20 25 30 

Gin Phe Leu Val Thr Asp Arg Gly Arg Thr Val Val Leu Tyr Lys Val 
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35 40 45 

Ser Asp Gin Lys Pro Leu Gly Ser Trp Ser Val Lys Gin Gly Gin Thr 
50 55 60 

lie Thr Cys Pro Ala Val Cys Asn Phe Gin Thr Gly Glu Tyr lie Met 
65 70 75 80 

Val His Asp His Lys Val Leu Arg He Trp Asn Asn Glu Asp Val Asn 

85 90 95 

Leu Asp Lys Val Phe Lys Ala Thr Leu Ser Ala Glu Val His Arg He 

100 105 110 

His Ser Val Gin Arg Thr Glu Pro Leu Val Leu Phe Arg Arg Gly Ala 
115 120 ' 125 

Ala Arg Gly Leu Glu Ala Leu Leu Ala Glu Pro Gin Gin Asn He Glu 
130 135 140 

Ala Val He Pro Asp Glu Glu Val He Glu Trp Ser Glu Val Phe Met 
145 150 155 i 160 

Leu Phe Lys Gin Pro Val Leu He Phe He Thr Glu Asn His Gly Ser 

165 170 175 



i 



Tyr Val Ala Tyr Val Gin Leu Cys Lys Ser His Ser Leu Ser Lys Tyr 

180 185 , 190 

lie Leu Leu Leu Gly Lys Glu Glu Lys Ser Ala Lys Pro Asn Phe Thr 
195 200 205 

Ala Arg Val Asp Gly Lys Phe He Ser Leu Val Ser Leu Gly Ser Asp 
210 215 220 

Gly Cys Val Tyr Gly Thr Leu Val Pro He Tyr Ser Ser Asp Thr Glu 
225 230 235 240 

Asn Asn Gin Arg Leu Val Arg Ala Leu Met Leu Lys Thr Val Val Ser 

24B 250 25E 

Gly Ser Ala Arg Asn Gly Ser Ala Leu Thr He Leu Asp Gin Asp His 

260 265 270 

He Ala Val Leu Gly Pro Pro Leu Pro Ala Ser Lys Glu Cys Leu Ser 
275 280 285 

He Trp Asn He Lys Phe Gin Thr Leu Gin Thr Ser Lys Glu Leu Pro 
290 295 300 

Gin Gly Thr Ser Gly Gin Leu Trp Tyr His Gly Glu He Leu Phe Met 
305 310 315 320 

Arg His Gly Lys Ser Leu Thr Val He Pro Tyr Lys Cys Glu Ala Ser 

325 330 335 

Ser Leu Ala Ser Ala Leu Gly Lys Leu Lys His Thr Gin Glu Ser Gly 

340 345 350 

Ser His Ser Val Pro His Phe Val Asn Trp Glu Thr Cys Ser Glu Tyr 
355 360 365 

Glu Leu Gly Ser Tyr Ser Ala Lys Gin Thr Arg Thr Leu Arg Lys Lys 
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370 



375 



360 



Thr Glu Thr Asn Leu His Pro Glu Val Pro Gly Val Lys Glu Leu Leu 
385 390 , 395 400 

i i 

Ser He He Lys Lys Asp Ser Glu Lys His He Glu Val Glu Leu Arg 

405 410 415 



Lys Phe Leu Ala Lys Ser Thr Pro Asp Phe His Thr He He Gly Asp 

420 425 430 

He Val Ser Gly Leu Leu Gly Arg Cys Lys Val Glu Pro Ser Phe Tyr 
435 440 445 



Pro Arg Asn Cys Leu Met Gin Leu He Gin Thr His Val Leu Ser Tyr 
450 455 460 



1 f 



Ser Leu Cys Pro Asp, Leu Met Glu He Ala Leu Glu His Thr Asp Val 
465 470 475 480 



Gin Met Leu Gin Leu Cys Leu Gin Gin Phe Pro Asp He Pro Glu Ser 

485 490 495 

i 

Thr Thr Cys Ala Cys Leu Lys Leu Phe Leu Ser He Gly Asp Asp Cys 

500 505 510 

Leu Arg Asp Ser Ash Val Ser Met Glu Ser Val Phe Asp Tyr Gly Asp 

515 1 ' 520 525 

Thr Thr Gin Asp Glu Lys Lys Glu Met Glu Glu Gin Thr Glu He Val 
530 535 540 

Gin Asn Gly Phe Gly Pro Glu Asp Gly Gly Cys Ser Lys Asp Ser Gin 
545 550 555 560 

Gin Leu Asn Lys Lys Pro Gly Asp Thr Ala Gin Glu Pro He Ser Phe 

565 570 575 

Pro Val Thr Leu Cys Pro Val Ala Pro Lys Arg Ala Ala Leu Leu Asn 

580 585 590 

Ala Val Leu His Ser Ala Tyr Ser Glu Pro Phe Leu Leu Pro His Leu 
595 600 605 

Lys Asp He Pro Ala Lys His He Thr Leu Phe Leu Gin Tyr Leu Tyr 
610 615 620 

Phe Leu Tyr Leu Lys Cys Thr Gly Ser Ala Thr Met Thr Leu Pro Gly 
625 630 635 640 

Val Ser Pro Pro Thr Val Ser Gin He Met Asp Trp He Cys Leu Leu 

645 650 655 

Leu Asp Ala Asn Phe Thr Val Val Leu Met He Pro Glu Ala Lys Arg 

660 665 670 

Leu Leu Leu Asn Leu Tyr Asn Phe Val Lys Ser Gin He Ser He Tyr 
675 680 685 

Ser Glu Leu Asn Lys He Glu Val Ser Phe Arg Glu Leu Gin Arg Leu 
690 695 700 



Asn Gin Glu Lys Ser Ser Arg Gly Leu Tyr Ser He Glu Val Leu Glu 
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1 



705 710 715 ' 720 

i 

Leu Phe 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 1305 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

! 

(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENf>E:i NO 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: HW016 ' 1 

(ix) FEATURE: 

(A) NAME /KEY: misc_f eature 

(B) LOCATION: 385.. 748 

<D) OTHER, INFORMATION: /label= SAC 24530 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: , 



ATTATTCTCC 


CCTTGTCATT 


AAAAGAAATC 


AGCAAAACCA 


AACAACTGGC 


TACACCACGA 


60 


ATTGTCGTTA 


AATTTGCTAA 


CTGGTGTCTA 


AAAGCCGTGT 


AGCTACCTCG 


GTCCTGCTTG 


120 


CTAGGTTTGC 


CACTAGAAGG 


AAGCATACTT 


AAAACAATGG 


CTACTTGGAT 


CCTCAGGGAG 


180 


ATCCTGTCTG 


CAGTCGCGTG 


GTCACCCTTA 


GCTTCATCAA 


AGCACTAACA 


GCTCACCCGG 


240 


CCAGGCTTCA 


TGAGCACTGA 


CCCTCAAGCA 


AGCAGGTTTA 


TTAAACATTT 


AGATGCCAAC 


3 00 


CTCACTTACT 


GTTTCCTGCA 


GTCATGGAGA 


GTTTACTTAA 


CAAGTTTGTA 


AATAATAAAC 


360 


TGGCACTTTG 


CACACAGACT 


TGGTACTATC 


CTAGGGGAAG 


GCCTGCTTTA 


TTTGGTTTCT 


420 


AGACCGAGTA 


GGAAGTGATC 


CATTTACCAC 


TGAGGGCAGC 


CCCATTCAGA 


GTCTTAAGTG 


480 


ACTAAGCCAG 


TGTTGAACAA 


GCAATTTCCA 


GGCTTTGTTC 


TTCAGGGAAC 


TTCCCATCAG 


540 


CTTTGAAGTC 


GGTCCTGTGC 


ACCCTAGGCA 


CATGGATCAG 


TTCACAAGTG 


GGGTTCAGTG 


600 


GAGAGAACTT 


CCCCCTCAGA 


AGTCACTTGA 


AACTTAGATG 


AGATTTGGGA 


CACTTGCTGG 


660 


TTGACTCTGT 


CTCATTTGTG 


TAAAAAGTAG 


TTTNTNTNAA 


NANTNGGTTT 


TNTTTTTTTT 


720 


TTCAAGGTTA 


TACTTTGTCC 


CATTCCTANG 


TTANTACAAA 


GTCTTGAAAG 


GGCCTTTGTA 


780 


GGGCTTTTTA 


ANNCAGGGTC 


TTAACTATGT 


AACTCTGGCT 


TGGCCTGGAA 


CTTGCTATGT 


840 


AGACCAGGTT 


ACCCTCAAAC 


TTGCCTGTCT 


TCCCAAATAC 


TGGGATTAAG 


GTTTCTGTGA 


900 


CCATACCTGG 


CTTTACCTGA 


TGAATTCCTA 


AACACCAGAA 


AACCAGTACT 


GTATGAGATG 


960 
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TTAATGTGTG TTCCTTTCAG ACTGGAGTAC AGACCAGTAG ATAACAGATA ACAGCTGGTT 1020 

CACCTTAATC TGCCTTTTTG TGTATTAATC TGTGTTTAGA GAACGGAACA ATAGCCAGAA 1080 

TTNNNCTAGC GAGTTCGAGG CCAGTTGGTG TATATGTGGG ACTCTTAACC AAAACAGCAA 1140 

GCGTTCCCTG GGGTAGTTCA CAATGATCTC CAGCTTCCTT GTTAACCAGA TAACTGCNAN 1200 

TCCAGATGTA TGACCCTGGT TGGTTTATTG TATTGATATG TTTCTGTAAT ATGAGTAAAT 1260 

TATTGTTACT TAAAAGTAAT AAATTCAAGT TTCAAAATTG AAAAA 1305 
(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 955 base pairs ' 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

t 

(iii) HYPOTHETICAL: NO 



(vii) IMMEDIATE SOURCE: 
(B) CLONE: HW017 

(ix) FEATURE: 

(A) NAME /KEY: misc_f eature 

(B) LOCATION: 393.. 627 

(D) OTHER INFORMATION: /label= SAC 24091 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 



GTCGACAATC 


CTGTTATGAC 


AAGCCCGTCT 


CGATACTGAC 


AACTGAAGTG 


GAAATTGCTT 


60 


GAACATCAAG 


ATAGCTAACT 


CGGAAAGAAT 


CCCAGACTTT 


GGATAGTTTC 


TGAGTCTTCT 


120 


AGAATTTTCC 


AAGAAGAAAC 


CATGGTGGCA 


AATGCGGAGA 


CTCGGGCGGG 


CACTGGGAAC 


180 


ACAGCACAGT 


GGTCTTAGGG 


AGGTGCTTTG 


TCAGGAATGA 


ACAGTCATGG 


TTATAATCCA 


240 


CGTTTCCATT 


GCTACTCACG 


AATGATTCTC 


TTCTGTTTTG 


TTTTTAAAAT 


TTTTTTACAC 


300 


TGAATTTCTA 


TTTAGACACT 


AAAACATATA 


GGGGTGCTTG 


TCACCCGGAT 


ACATTTATCT 


360 


GTGAGCCAGC 


TATTCGGATG 


TCATGGCTGG 


GTACCTAACT 


TACTTCCATA 


TGTGAAGTGT 


420 


GCTAAACTCA 


AACCAGTTTA 


CAGAAACGAT 


GTATTTTGTG 


TATAGTAAAT 


TGTATATTCA 


480 


CCCTTTTACC 


ACGGCCGGTT 


TTTTAAACAA 


ATGAATACTC 


TAGATTTTTC 


TTCCAAATGA 


540 


GGTTACTGTT 


GGGGTGGGGT 


TGACTTAGTG 


ATGCTGTAGA 


AGGGAATCCG 


CATGCACTAA 


600 


AAAGTGTGTC 


TGCCTAAAAG 


TGTGTACAGC 


AGGCACAACC 


TTCCTTGTGG 


ATTTCCGTCT 


660 


GCTCACTGCA 


GATCTGCCTG 


TGGTTTAGAA 


ATAGAATTCA 


AGAGCCATCA 


AGGAGTGACC 


720 


AGCTTGACAC 


CACTGCCAAA 


TTCAGAATGA 


GGAACCTTGA 


GAGAAGGAAC 


TGTCGCTCAG 


780 


CCAGAAGCAG 


ATGAAGCCAC 


TGGTCCTGGT 


TCACAGTTTA 


GTGTCATGCT 


CTGGTTTATA 


840 
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AGGATGGGCA TTACGTTCGG TGTTATCTGG TTATCTGGGA GATTCTAAAC CGAGGTGCAA 900 
GAGCTCCTAT CCTGATTCTG AGATTTGAAA TAAAGCATGA ATGAACAAAG AAAAA 955 
(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1198 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

•it 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: HW018 



(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 86.. 1102 

(D) OTHER INFORMATION: /product= "predicted polypeptide" 

(ix) FEATURE: 

(A) NAME /KEY: misc_£ eature 

(B) LOCATION: complement (644.. 994) 

(D) OTHER INFORMATION : ' /label= SAC_23880 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

GTCGACCTGG GAGTCCGGGC AGCGAGCCCC TCGCCGTCGG GTCCCACGAA GAAACTGGTG 60 

GCTGAAGCCA TATTCTGCAG CATTC ATG GAG ATC AGT AAA AAT ATT ACG AAC 112 

Met Glu lie Ser Lys Asn lie Thr Asn 

1 5 

CCA GAA GAA CTT TGG AAA ATG AAG CCT AAG GGG AAC CTG GAA GAT GAC 160 
Pro Glu Glu Leu Trp Lys Met Lys Pro Lys Gly Asn Leu Glu Asp Asp 
10 15 20 25 

AGT TAC TCG ACT AAG GAC ACC GGA GAG ACA AGC ATG CTG AAA AGA CCG 208 
Ser Tyr Ser Thr Lys Asp Thr Gly Glu Thr Ser Met Leu Lys Arg Pro 

30 35 40 

GTG CTC TCG CCT TTG CCG CAC ACG GTC CAC GTC GAC GCC TTC GAC TGC 256 
Val Leu Ser Pro Leu Pro His Thr Val His Val Asp Ala Phe Asp Cys 

45 50 55 

CCC ACG GAG CTC CAG CAC ACG CAG GAA CTC TTT CCA AAC TGG CAA TTG 304 
Pro Thr Glu Leu Gin His Thr Gin Glu Leu Phe Pro Asn Trp Gin Leu 
60 65 70 

CCA ATT AAA GTT GCC GCC GTC CTA TCG TCT CTC ACC TTC CTG TAC ACT 352 
Pro lie Lys Val Ala Ala Val Leu Ser Ser Leu Thr Phe Leu Tyr Thr 
75 80 85 

CTT CTG AGG GAA ATC ATT TAC CCA TTA GTC GCT TCC CGC GAA CAG TAT 400 
Leu Leu Arg Glu lie lie Tyr Pro Leu Val Ala Ser Arg Glu Gin Tyr 
90 95 100 105 
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TTT TAT AAG ATC CCG ATC CTG GTC GTT AAC AAA GTC TTG CCA ATG GTC 448 
Phe Tyr Lys lie Pro lie Leu Val Val Asn Lys Val Leu Pro Met Val 

110 115 120 

TCC ATT ACC CTC TTG GCA TTG GTT TAT CTG CCA GGA GAG ATA GCG GCA 496 
Ser lie Thr Leu Leu Ala Leu Val Tyr Leu Pro Gly Glu lie Ala Ala 

125 130 135 

GTT GTG CAG CTT CGC AAT GGG ACC AAG TAC AAG AAG TTC CCA CCC TGG 544 
Val Val Gin Leu Arg Asn Gly Thr Lys Tyr Lys Lys Phe Pro Pro Trp 
140 145 150 

TTA GAC AGA TGG ATG CTG GCG AGG AAA CAA TTT GGG CTC CTC AGC TTC 592 

Leu Asp Arg Trp Met Leu Ala Arg Lys Gin Phe Gly Leu Leu Ser Phe 
155 160 165 

i 

TTC TTT GCA GTT CTG CAC GCC ATT TAC AGT CTC TCG TAC CCA ATG AGA 640 

Phe Phe Ala Val Leu His Ala lie Tyr Ser Leu Ser Tyr Pro Met Arg 
170 175 180 185 

CGG TCC TAC AGA TAC AAG CTG CTC AAC TGG GCT TAC AAA CAG GTT CAA 688 
Arg Ser Tyr Arg Tyr Lys Leu Leu Asn Trp Ala Tyr Lys Gin Val Gin 

i 190 195 200 

CAA AGC AAA GAA GAT GCC TGG GTT GAG CAT GAT GTC TG<? AGA ATG GAG 736 
Gin Ser Lys Glu Asp Ala Trp Val Glu His Asp Val Trp Arg Met Glu 

205 210 215 

ATT TAT GTG TCC CTG GGG ATT GTG GGA CTG GCC ATC TTG GCT CTC TTG 784 
lie Tyr Val Ser Leu Gly lie Val Gly Leu Ala lie Leu Ala Leu Leu 
220 225 230 

GCT GTG ACA TCT ATC CCA TCT GTG AGC GAC TCT TTA ACC TGG AGA GAA 832 
Ala Val Thr Ser lie Pro Ser Val Ser Asp Ser Leu Thr Trp Arg Glu 
235 240 245 

TTT CAC TAT ATT CAG AGC AAA CTG GGA ATT GTC TCT CTT CTT CTG GGC 880 
Phe His Tyr lie Gin Ser Lys Leu Gly lie Val Ser Leu Leu Leu Gly 

250 255 260 265 

ACG GTA CAC GCT TCG ATT TTT GCC TGG AAT AAA TGG GTA GAT ATC AGT 928 
Thr Val His Ala Ser lie Phe Ala Trp Asn Lys Trp Val Asp lie Ser 

270 275 280 

CAG TTT GTC TGG TAC ATG CCT CCG ACT TTC ATG ATA GCT GTT TTC CTT 976 
Gin Phe Val Trp Tyr Met Pro Pro Thr Phe Met lie Ala Val Phe Leu 

285 290 295 

CCA ACT GTT GTT CTG ATC TGT AAA ATT GTA CTT TGC CTG CCC TGC CTG 1024 
Pro Thr Val Val Leu He Cys Lys He Val Leu Cys Leu Pro Cys Leu 
300 305 310 

AGG AAG AAG ATA CTG AAG ATT AGA TGT GGT TGG GAA GAT GTC CGC AAA 1072 
Arg Lys Lys He Leu Lys He Arg Cys Gly Trp Glu Asp Val Arg Lys 
315 320 325 

GTT AAC AGG ACT GAG ATG GCC TGC AGG TTG TAGAACTACT GTTTGCACAT 1122 
Val Asn Arg Thr Glu Met Ala Cys Arg Leu 
330 335 

AATTGCTTAA TACCGATGTC TTATAAACAT TTCAAACCTG TGTCTGTTAA TAAAATGAAT 1182 
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11 



1198 



ACTTCTGGAT CAAAAA 



2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 339 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

TlA m hT . K Rn p r o Glu Glu Leu Trp Lys Met 
Met Glu He Ser Lys Asn He Thr Asn Pro ^iu* ^ 



Lys Pro Lys Gly Asn Leu 

20 



10 

Glu Asp Asp Ser Tyr Ser Thr Lys Asp Thr 
25 3 V 



Gly Glu Thr Ser Met Leu Lys Arg Pro Val Leu Ser Pro Leu Pro His 



35 

Thr Val His Val Asp 
50 



40 

Ala Phe Asp Cys Pro Thr Glu Leu Gin His Thr 
55 60 



m oi« ran Pm He Lvs Val Ala Ala Val 
Gin Glu Leu Phe Pro Asn Trp Gin Leu Pro lie uys 8Q 

65 70 
Leu Ser Ser Leu Thr Phe Leu Tyr Thr Leu Leu Arg Glu lie lie Tyr 

85 

Pro Leu Val Ala Ser Arg Glu Gin Tyr Phe Tyr Lys He Pro lie Leu 



100 



Val Val Asn Lys Val Leu Pro Met Val Ser He Thr Leu Leu Ala Leu 

115 120 



^-i r*i„ ti* Ala Ala Val Val Gin Leu Arg Asn Gly 
Val Tyr Leu Pro Gly Glu lie Ala Aia vax ^ 

130 13 J 

Thr Lys Tyr Lys Lys Phe Pro Pro Tr P Leu Asp Arg Trp Met Leu Ala 

150 A " 



Arg Lys Gin Phe Gly Leu Leu Ser Phe Phe Phe Ala Val Leu Hi. Ala 

165 

He Tyr Ser Leu Ser Tyr Pro Met Arg Arg Ser Tyr Arg Tyr Lys Leu 

180 185 
Leu Asn Trp Ala Tyr Lys Gin Val Gin Gin Ser Lys Glu Asp Ala Trp 

toe 2UU 



, m » « rin Tie Tvr Val Ser Leu Gly He 
Val Glu His Asp Val Trp Arg Met Glu lie ryr 



210 



215 



Val Gly Leu Ala He Leu Ala Leu Leu Ala Val Thr Ser He Pro Ser 



225 



230 



Val Ser Asp Ser Leu Thr Trp Arg Glu Phe His Tyr He Gin Ser Lys 

245 " u 
LeU Gly He Val Ser Leu Leu Leu Gly Thr Val His Ala Ser He Phe 
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260 265 270 

Ala Trp Asn Lys Trp Val Asp lie Ser Gin Phe Val Trp Tyr Met Pro 
275 280 285 

Pro Thr Phe Met lie Ala Val Phe Leu Pro Thr Val Val Leu lie Cys 
290 295 300 

Lys lie Val Leu Cys Leu Pro Cys Leu Arg Lys Lys He Leu Lys He 
ii 305 310 315 320 

targ Cys Gly Trp Glu Asp Val Arg Lys Val Asn Arg Thr Glu Met Ala 

325 330 335 

i ■ i 
Cys Arg Leu , 

(2) INFORMATION FOR 1 SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 1801 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 



(vii) IMMEDIATE SOURCE: 
(B) CLONE: HW032 

(ix) FEATURE: 

(A) NAME/ KEY: misc__f eature 

(B) LOCATION: complement (1412.. 1793) 
(D) OTHER INFORMATION: /label= SAC_24831 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 



TTTTTCCAGC 


TTTTTAAGTT 


GCATTTATTT 


TTGATAATCT 


GTACTGTAGC 


CAATCAATTC 


60 


TATTCTCCAT 


TCTTGCTTAA 


TTGTATTAGC 


CATAAAGGAG 


TTCAGGTTCA 


TAGGACTCAG 


120 


TTTCTCTATT 


AAACCGTCAT 


TGATTATGGA 


AATACTGTCT 


TAAGCACTAT 


GCAGATTAAC 


180 


ACCATACTTT 


TCTTCCATTA 


CATCTTTCAA 


GACAGTCACT 


CACAACTCAG 


AAGACTGATA 


240 


CAGCATTAGA 


TGAAAAAAAC 


AAGATTCACA 


GATATATGTA 


GTGAAACTCA 


AACTAGGAGT 


300 


CTTAGAGATA 


AAAAAGATAA 


GTAGATATTT 


AAAATGTAGT 


CTGTTAAAGA 


TTTAATCGGT 


360 


TTAAAGTATA 


AGATTAGAAA 


GACAAAGTTT 


CTATTGTTTT 


GAAAGTTTTA 


AGTAAAGAAA 


420 


AAAAAACTGG 


AAAATATGTC 


TTTGTTGTTC 


TTTCTCCGTT 


CTTAATTACG 


AGCTATCCAA 


480 


CAGAAACCCC 


TAGGTCATAG 


TTTATGTTTC 


TGTTCTCATC 


AATGTAACTA 


GAACACTGGA 


540 


GAAAGAAAAA 


GGAAGACATA 


TTAACCAATG 


GCCGCAATCA 


CTTTTCACTT 


GCCCCACGGT 


600 


ACAAATCTGA 


TTTGGATGAA 


ATGGTTTGTG 


GCATAAAAAT 


TTTAAATTTT 


AGAAGCCAAT 


660 
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GTGf2T AG ATT 


TPTPPATGTT 

X A V~ Wl X w A A 


TGCACAAATP 


CAATTTCAAC 

W4»*» AAA %««MkW 

1 


TACTTTTACT 

A A A A A A 


TCACTAATTA 

A A CM • A A * * 


720 


ATTAAATGGA 
AX XAAAXo\3A 


TTGGAAAPAT 

X X wnnAUn X 


TCTAGAAATG 


AAATTGTGTC 

iTmtmrm A A X# A A W 


TTTAGATGGA 


GTTGGGGAGA 

1 


780 




TTTTAAATAT 


TTAGTTCAAA 


f 

TTTCATTTTC 

AAA W«* A A A A w 


ACTTCTTGGT 

*A^* A A V* A A ^J^l A 


CTTATTATTT 

X* A A «• A A ii AAA 


840 


P & f2PTTP AGT 


TGGGGPTOOP 

X V?V3ww w X www 


TTPTGTGGGG 

X A w X O X wuW 


P TT AP ATGPT 

w A X Awn X X 


GACTGTGTCA 

wnw x w x *J a wx* 


GGCAGCTGGA 

UwwitVJw X wwl 


900 


nrzHYZ ATPTAP 
uu 1 \ja 1 v* 1 


ATTY2APAGGA 

AX AwAwAwwA 


TTPAATGAPT 
x x unn x uau x 


TTPTPPTPPG 


PGAAAAGCPP 


PTGTPAGPTA 

w A w X V* /\w V* X A 


960 

J w w 


. P AP A AflG ATG 

1* 


GGAGGTTTTC 


GGCACACTGT 


PAAGTGATTG 


PAGATPCTGA 

w AwA X w w X \3r\ 


CTTCTPAAAT 

w A A w A w AAA X 


1020 

A> V <w w 


A TVS A GGTGTP 


GGCAGTGGCA 


GCAGCAGCTA 


CTGGTAGCTG 

W X ww X Aw^» X \9 


CCCCAGACTC 

1 


CTCAGATAAP 

W X wAwn X A/VW 


1080 

A, W U V 


TTAPAPAGGA 

X AAwAV-AwwA 


GCTGGTTTAT 


GCATCCCCTT 


CGTCAGAGCT 


TPPTPTGGGC 

A W A W A WW ww 


AAGTGAAAGT 


1140 


TGPAAGPCTT 

X wwAAww w X X 


CACTTGQCCC CATGTCTCAG 


CTCTCTAAGT 

W A W A v-t A onv A 


GACCCTGCCT 

w/x\^ A A 

i 


GTGCTGGCCC 

VJ A WW A WW V W W 


1200 

A> Al W w 


TTGTGPTTAT 


GGTGGTGGTG 


GTGGTGGTGC 


AAGCCTGAAG 


GAGGAAGACT 


TGTCTCAACA 

a vj a v> a wnnvn 


1260 


TPT AATGCP C 

1 l« X AA X U\r>\< V* 


CTGGTTGCTG 


ATTCTCTGAA 


GGCTTACTGC 

VS W w X X *»W X ww 


TCCCAGCAAC 

t 


CGTCAGPCTP 


1320 

A «J At U 


AGTPGPAGPP 


GGGCCTCGCT 


CCTCAACTTG 


GCAAAAATGC 


PTAPAGAGAC 


TGAGAGATGP 

x wnwnu a x w>a> 


1380 

A *J W W 


ATPGAGTPPP 
A x wunu X v» ww 


TGATTGCTGT 


( TTTCCAGAAG 


TAPAGTGGGA 

X nV« Aw X wwwA 

t 


AGGATGGAAA 

AwwA A ww AAA 


TAGPTGTPAT 

X Aw Vw X w X w A X 


1440 


PTPTPPAAAA 
w A V*. A wwtnnA 


CTGAGTTCCT 

f i 


TTCCTTCATG 


AAPAPGGAGP 


TGRPPGPPTT 

X wwwwww w X X 


PAPGAAGAAP 

wAwwAA\9AAw 


1500 

1 JvU 


PAfSAAOGAPP 
w A\J AAO wA w w 


CCGGTGTCCT 


CGACCGCATG 


A Tfl A AG A AGP 

A X UnAunnU\« 


TGGAPPTPAA 

X wwAw w X wAA 

t 


PAGTGATGGG 

w AVJ X V7A X www 


1560 

t 


P Af2PT AC ATT 
w Aval* 1 AwA X X 


TCCAAGAGTT 


TCTCAACCTT 


ATTGGTY5GPT 

A X X \Sw X www X 


TAGPTATAGP 

X Aww X A X AVJ w 


ATGPPATGAG 

A X wW w A X Urtv 


1620 


TCCTTCCTCC 


AGACTTCCCA 


GAAGCGTATC 


TAACCCTCTC 


i 

CATTCCCTTC 


CAGCCACCAA 


1680 


GTCATCGCCT 


CCTCCACTCC 


TTCCCCCATC 


CACACCTGCA 


CTGAGCCCAC 


CACACCTACC 


1740 


ACACATGCAG 


CCCACGCCTG 


ACAGGGAAAA 


TAAAACAATG 


TCATTTTTTT 


AAATGTAAAA 


1800 


A 












1801 



(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2646 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 



(vii) IMMEDIATE SOURCE: 
(B) CLONE: HW033 

<ix) FEATURE: 

(A) NAME /KEY: misc_feature 

(B) LOCATION: 50 ,.263 

(D) OTHER INFORMATION: /label* SAC_23901 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
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GTCGACTTGA 


CTTCTTAGTT 


TTCTGACGGG 


AGCTCTCTTC 


CTTCTGAGTA 


CTAGTCTCGG 


60 


GCCTCCAGCC 


CCAGGAAGAC 


CCTCTGCTTT 


TCTCTCCACC 


CTTGGCTTGC 


TCCTCAGCAG 


120 


GGAAGAAGTA 


GATCACAGTC 


TCACTGCCTT 


CAGTTCCCAG 


CTTTGCCAGT 


CACCAGCTGT 


180 


GTAGTCTCAG 


GCCAATCACC 


ACCTCCTCTA 


GCCTAGGTAG 


TTGGTTTTAT 


GTGGGTGAAT 


240 


GTTTTGCCTG 


GATGTATGTA 


TGTACACCAC 


CTGTGTGCCT 


GGTGCCCCTG 


CAGTCAGAAG 


300 


AAAGTGCCAC 


ACCCCTAGGA 


ACTGGAGTTA 


AGATGGGTTA 


GAGCCTCCAT 


GTGGGTGCTG . 


360 


AGAACTGAAC 


CTGGTTCTCT 


GGAAAAGCAG 


CCGGCACTCA 


ACCTCTGAGC 


CAGCTCTTCA 


420 


TCCCCGAGTC 


CGTTTATTTA 


ACCGGCTATA 


AAATGGATTT 


AAAACGACTT 


ACCTCACCAG 


460 


GCCTGTGAAA 


CATTTAAAAA 


CTTAATAAAA 


ATTCTACACT 


+ 

TAATTGTTTT 


TGCTTATGCT 


540 


TTCTAGTCCC 


TTTAGTCTTA 


GACAGTTTCT 


GTTTCTTGTA 


TTTATAAAGT 


TGGTATTAAA 


600 


TATATAGCCA 


GTGGACAAGA 


ACCTTACTGC 


TATGGGCTGG 


CCfCTGTAGG 


GTCCTAGACT 


660 


AGAGGAGCGT 


ACTTTGTGGT 


TTAAAGTGGT 


TTGGCCTGCC 


TACCTGTTTT 


TCCCACTGGA 


720 


CTCAAAACAC 


TAATAAAGCA 


GGGGTGACCT 


GTGTCACCTG 


AGCATTGAGC 


AGAACACTGG 


780 


CCCAGGCAAT 


GGCCTCAAAG 


CTGGGCTACA 


TTATAAAGTG 


AACTTTCTTG 


TTCATACTGG 


840 


GTCACAGAAG 


CAGGCCCAGA 


AAAAATAAAA 


AATCCTGGTC 


ACTCGGTTCC 


CCCTTGCACC 


900 


ATCATCTGTT 


CACTAGTGCC 


CCTCTGACCA 


TTCTCACCCA 


1 | 

AGACAGGCAT 


CCTCTGGGAA 


960 


CCTTAGGCAG 


GTCCATTTCT 


TCTTTGTCAT 


TTGGAATGAA 


GCACTAACAG 


CCCTGGGGTC 


1020 


TCTGCCATTG 


GCTTCCCAGA 


GTTGTCCAAT 


GAGTTTTCGT 


GAAGGAATCT 


GGGTTTATCT 


1080 


TTCCCGGGCT 


TCCTGCCCTG 


TTAGCAGAAT 


TTCAAATAGC 


AGATAATAGA 


TTCTGACCTG 


1140 


TGAGGCCATT 


TTTCTCTCAA 


TCCCTTTCTT 


CTGAGACATT 


AGCCTTGGGC 


ATGCAGCAGC 


1200 


TGCGCACTGG 


TTTCAGGGAG 


GAAAGGGGTG 


AATGGTGAGC 


GAACTTGGCC 


CTGTGGATGC 


1260 


TAACTCCACT 


GAGATGAAAA 


CATGGGGGCT 


TTTCTCTGAA 


GACGCAGGGC 


CGTGTAAACA 


1320 


TATGACTTTG 


ACCATATTCT 


CAGGTTTGGG 


GATGTGGGAA 


GTTCCAGGAA 


CAGAAATCCC 


1380 


CTGCTGAGTT 


CTTGGGCACT 


GTGGCTATTT 


GTTCCCATGA 


GGACCTCTCC 


CTGCAGTCTG 


1440 


CCCTCTGGCA 


GGGTGGAGTA 


TAGAAGACTC 


AAGAGCTACA 


GATGCCCCCG 


TCACTGGAGC 


1500 


TCCCCTGCAC 


CTAGGCCATC 


AATCACGTCC 


ACCATCACCA 


GAGGGGCTTA 


GGCCTAGGGT 


1560 


GTATGCTGAG 


GGAAAGACGA 


GGGCCTGTCC 


CACTACTCGA 


GTAGAGCAGA 


TGGCCTTCTG 


1620 


TGTAGGCAGT 


GAATTTCATA 


GCAGTGCCTT 


GAGGAGCCCA 


GCATGTTCTC 


AAGCACCTCC 


1680 


TCCCAGTTTA 


TATTTGGAAG 


TCTGCTTCAC 


AGGGACCAGA 


GGGCATAAAC 


CTAACCTACC 


1740 


CACTTCCCAT 


CAGGGACAGC 


CCAAAGCACA 


CCAGTATATG 


TAAGGGGTGC 


AGGGAGGTAT 


1800 


CATGGGTCTC 


TTCTCTCCTC 


TCAGTGCTCA 


AACTCAGGTG 


GAGGCTATGG 


ATTCAGGATC 


1860 



-60- 



SUBST1TUTE SHEET (RULE 26) 



WO 98/53071 



PCT/US98/10547 



TTGGTTAGGA GTTGCAAGCA ATGGACAAGA AAGAGCATTA GGGAAACCAG AATCCCAGCC 1920 

TGTCCCTGTG CCTCCACCCC AGCTAGACTT GAGGCTTAGG TCCTGATACT CTGCCCTTAC 1980 

ACACCCTCTC TCCATTTCCC AGGAGTTAGT AAAAAACAGG CAAGGGTAAT ATGCTATCAT 2040 

CTGTGGTCCT GAGCCTTCTC CAGACTTCCC CTTCACTCCG CCGCCACTGC CACCACCACA 2100 

GAGACATCTT CGGAAGCATT AGGTTTATTT CTTGAACTGA AGTCCTGGGG AGAAGATGAC 2160 

TGGTGCCACC ACATGTGCTC TGTCTGGTTA GGGGCCTTGA GGTGGCACAG TCCAGCCATA 2220 

GAAGGATCAG CCTTCCTGCC CATACAGATT, AATGCCAGGG AGTCAGATAG TTGATGCGAC 2280 

TGTAAAGAGA AAGAGAGGTG AGATGGGGAC TCCAAGGTAC TGTAAAGAGA AATCAAGTTT 2340 

TTCTTCCCTA GCTCCATCCT CCACTCCAAA GCCTCAGGGA TCTGAAAACA GTCTTCATCC 2400 

AGTCTATGTG CCCACAACCT TCAATGGTTC CCTACTTCCT CTTGTATATT TTTGGCTATT 2460 

CTCTTCACCT CTAATTAGCT CATACTGTTT CCCACTGACT CCCGTCTTTA ATGGCTTCAT 2520 

AGGTTTTCAG TCACGTATGA ATCTACACTG CATCACACAG TTACAAAACC CAGAGTTGGG 2580 

GTTGGGGATT TAGCTCAGTG GCCTAGCAAG CGTAAGGCCC TGGGTTCGGT CCCCAGCCCC 2640 

GAAAAA 2646 
(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: ' > 

(A) LENGTH : 4339 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 



(vii) IMMEDIATE SOURCE: 
(B) CLONE: HW034 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 91.. 837 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 3101.. 4162 

(ix) FEATURE: 

(A) NAME /KEY: misc.feature 

(B) LOCATION: 3561.. 3862 

(D) OTHER INFORMATION: /label* SAC_23897 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
GTCGACGGAA GAGCTGTTGT GACTGCGGTA CTGCGGAGGG CTCTCGGCGG CCCGCGACAA 60 
GGCTGAGTCC CGGGAGGTCC ATGCCCCACC ATG CTT TCC TGT GAC ATC TGT GGT 114 
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Met Leu Ser Cys Asp lie Cys Gly 

1 , 5 



GAA ACT GTA ACC TCA GAA CCA GAT AGG AAG GCT CAC CTA ATT GTT CAC 
Glu Thr Val Thr Ser Glu Pro Asp Arg Lys Ala His Leu lie Val His 
10 15 20 



162 



ATG GAA AAT GAA ATT ATC TGT CCG TTT TGC AAG CTG TCG GGT ATA AAT 
Met Glu Asn Glu lie lie Cys Pro Phe Cys Lys Leu Ser Gly lie Asn 
'i 25 30 35 40 



A 



TAC AAT GAA ATG TGT TTT CAT ATT GAA ACC GCT CAT TTT GAA CAG ACT 
Tyr Asn Glu Met Cys Phe His lie Glu Thr Ala His Phe Glu Gin Thr 

45. 50 55 



210 



258 



ACG CCA GAA AAA AGC TTC GAG ACQ CTA GCC GCA GTG CAA TAT GAA AAT 
Thr Pro Glu Lys Ser Phe Glu Thr Leu Ala Ala Val Gin Tyr Glu Asn 

60 ' 65 70 



306 



TCA GAC CTC GGT AAT ACC AAG CTG CAC AGT ACA GTG GAA GTT ACC TCA 
Ser Asp Leu Gly Asn Tftr Lys Leu His Ser Thr Val Glu Val Thr Ser 
75 80 > 85 



354 



GGC ATC CAT TCA GCT TGT GCA TCG AAC TTT CCA AAG GAG TCG TCC GAA 
Gly lie His Ser Ala Cys Ala Ser Asn Phe Pro Lys Glu Ser Ser Glu 
90 95 ' 100 



402 



AGC CTT TCT AAA GAT AGG ACT TTA AAA CAA GAA GCT TTT TAT ACA GAG 
Ser Leu Ser Lys Asp Arg Thr Leu Lys Gin Glu Ala Phe Tyr Thr Glu 
105 110 - 115 120 



450 1 



AGC GTA GCT GAA TCT AGA AAG TAC CAG AAA AGC AGA GAA AAG CAG TCC 
Ser Val Ala Glu Ser Arg Lys Tyr Gin Lys Ser Arg Glu Lys Gin Ser 

125 130 135 



496 



GGA TTG TCT GAG GCC CAA GGA TCA ATT TAT GAA ACA ACA TAC AGT CCC 
Gly Leu Ser Glu Ala Gin Gly Ser lie Tyr Glu Thr Thr Tyr Ser Pro 

140 145 150 



546 



CCC GAA TGT CCG TTC TGT GGG AGA ATT GAG AGG TAC AGT CAA GAT ATG 
Pro Glu Cys Pro Phe Cys Gly Arg lie Glu Arg Tyr Ser Gin Asp Met 
155 160 165 



594 



GAA ATT CAT GTG AAA ACA AAG CAT GCC AGC CTT CTA GAA AGT CCG TTA 
Glu He His Val Lys Thr Lys His Ala Ser Leu Leu Glu Ser Pro Leu 
170 175 180 



642 



GAA GAC TGT CAT CAA CCA CTC TAT GAC TGT CCC ATG TGT GGG CTT GTC 
Glu Asp Cys His Gin Pro Leu Tyr Asp Cys Pro Met Cys Gly Leu Val 
185 190 195 200 



690 



TGT ACA AAT TAT CAC ATT CTC CAA GAA CAT GTG GAC TTG CAT TTA GAA 
Cys Thr Asn Tyr His He Leu Gin Glu His Val Asp Leu His Leu Glu 

205 210 215 



738 



GAA AGC AGC TTT CAA CAA GTA CTT GTG AGT GGG TAT GTG CCT GTG TGG 
Glu Ser Ser Phe Gin Gin Val Leu Val Ser Gly Tyr Val Pro Val Trp 

220 225 230 



786 



ACA CCC GAC CTG CGA TGT CAT CCT CAG GAA GGC CGT ACA ACT CCT TTA 
Thr Pro Asp Leu Arg Cys His Pro Gin Glu Gly Arg Thr Thr Pro Leu 
235 240 245 



834 
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AGA TGAGTTTCTT AGTATCTTGG AACACACCAG TTAAGCTCCC AAGTTTGCTT 887 
Arg 

CTATCTCCAC CTCCCTAATA ATGCTGGAGT TACAATCATG GACAACCATA CACACATGTT 947 

TCACATGATT CTGGGCACTG AGCTTAGGTC CTTATGTCTG TGAGGCAAGC TCTTCCTGAC 1007 

CGAGCTATCT CCTGAGCCTG CTGTCTGTCT CTTTAAATAA TATTAAATAC TTCATATAGA 1067 

GCTCTCGAAG GTATAGGGTC TTGTATGAGT CTCCTAGGGC CACTGAAAAA TAAAGTGACC 1127 

TCTAAGAGAA GTTGTTTATA TTTTTATTAC, TCTGTCAGTA TTTGGGAGTA TGTAGACAGC 1187 

TCCTGGTAAA GTAGGAGATT TTAGCGTTCG ATGATTTACC ACCAATAACC GTGATCTCTA 1247 

TGCATCATGT ACCTGCTTTG CTCATTACCA CAAGAGGAAA ATTAGTAGAA TATTCTTTTT 1307 

TTTCCTTTTC CTTTTTTTTC TTTTTTTCGG AGCTGGGGAC CGAACCAAGG GCCTTGCACT 1367 

CGCTAGGCAA GCGCTCTACC ACTGAGCTAA ATCCCCAACC CCAAATTAGT AGAATATTGA 1427 

t 

TGGAAGATTA TTTCACGTCA GTGGTCTTCT GCTGTCTACC CCAGTTGGCA TGCCACTGTC 1487 

GTTGTTTAGG GTTCCTGTTG CTGTAATAAA ACACTACCAA CAAGAGTAAC CTGGGCAGGA 1547 

i 

AAGGGTTTAT TTTATCCAAA CATTTCCAGT TCACCACTGA GGGAATTCAG GGAAGGGACT 1607 

CAAGGCAGGA ACCTGTAGGC AGGAGCTGAT GCAAAGGCCA ,TGAAGTGCTG CTGAGTGGCT 1667 

TGTTTTGCCC TGACTTGTTC AGACTCCTTT CTTAGAGCAC CCAGGGATAT CAGCCCAGGG 1727 

GTGGCATCAC CTACAAGGAC TGGGCCTTCC AACACCAGTC ACTACAGAAC GCACTACAGA 1787 

CTTGCCTGCA CTCGGTCTTT CTTTACTGTT GTATTGTTTT TTGAGATGGA GTCTCACTAT 1847 

GTAGCTCTGG CTAAGTGTAA ACTCTAGCTT GTGTCAAGTT GACCTAAAAC TAGCCAGCAC 1907 

AGCTATCTTC CTTTGAGTAA GATTGTAAGA GGATGCTGTG GAAGGGGTAA TTTTTTCAAA 1967 

GCAGAATGTC TGATAATTCT TACTTTAGGG ATAGGCTTTG TATTGTATTC TATAAACCAA 2027 

TGAAACAGTC TCAGGGCATG CTAAAAATCA GTTGTGCAGT ACATGCCCAA GAGTGGAAAC 2087 

TAGAACCAGT GGGAAGGTGT AAAATGGGTT AATCAACAGG TGTGTGCATT GTACTGTCTC 2147 

TGAATTGAGG AGTACCTAGA AGACTCCTAA CTGTTTCTTC TGGGTGTATC TCTGAGGATG 2207 

TTTTCAGAAT AACTGATGGG TGCGTTGGCA AACAGAATGA AGAAGATGTG CATTCAGCGT 2267 

GGGCAAGTAG TGTCCAGTGT CGCTAGTACT ATAGGTGGAC CAGGAAAGCC GAAGCAGTGG 2327 

CAGAATCACT TTGGCTCCTC CTTCTGGATG GGGCTTTCTT TCCCTGCAGC TGCTCCCTGG 2387 

GTGCAGCTGC TCCCTGGGTT ATACTCCAGG ATGTCTGACT TTTGGATGCT GGAACTCTGA 2447 

TAATCCCTAC CAGAAATTTT GAGCCTTTGA CCTCACACTG GGATATCTGA TTTTGCCAAA 2507 

CTGGTTCTGG CACTTTCAAC TTTCACTACT GGTTTCTCTG TGTCTCAGGC TATCAAGAAC 2567 

TTTTCATTTG TATCATTATG TGATCTATTC CCGTATGATT CGTGATATTT AATAAGGTTT 2627 

TCTATATCTT TCTGTCTGTC TGTCTCTGTG TGTGTGTGTA CATACACACA CACGCCTGTG 2687 
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* w A UV) 1U 1 1 i 


i I* 1 lnlui 1 \3 


i LAO 1 Lt 


lull Av- 1 viuA ubALV/L 1\»AU 


U AU X AAAbAA 




TTTAGGAAAf* 


A AC1 Af*Tf* A AT 




A wVA I AL AAVjr v,V,A 1 lunU 1 A 

! 


AAVjAA 1L1 u A 




GTTTCAGTGG 


GAGAGAGGAA 


GCATGGCATT 


ATGCTATAAA ATTATTCAGC 


TTGTTCGGGT 


2867 


AAAAACTGTA 


ATGTAATGAA 


ACAGTTGGAG 


GTTGTGAATA TATATTTTAT 


ATTATTTATT 


2927 


AGTTTTTTCT 


TTCCATTAGT 


TGTGTTCAGA 


GATATCAATG ATAAGGGTCT 


TTCAACCGCT 


2987 


CA&CCTTTGT 


TTTTCTCCCT 


AAAGGTGTTA 


GGATGTTCAC AATTGCTAGT 


TAATGAGAAA 


3047 


TGCTTGTTAG 


AGCCTAAGCA 


CTGACTCATT 


CGGGTTGTCC CTCTCCAATA 


GGC ATG 


3103 



• Met 

1 

GAC AGA GTC CAG TGT TCT AGT GAC AGA GAA TTA GCT CAC CAG CTT CAG 3151 
Asp Arg Val Gin Cys Ser Ser Asp Arg Glu Leu Ala His Gin Leu Gin 

5 10 15 

CAA GAA GAA GAG AGA AAG AGG AAA TCT GAA GAA TCA AGA CAA GAA AGG 3199 
Glh Glu Glu Glu Arg Lys Arg Lys Ser Glu Glu Ser Arg Gin Glu Arg 
20 25 30 

GAA GAG TTT CAG AAA TTG CAG CGG CAG TAG GGC TTA GAT AAT TCT GGA 3247 
Glu Glu Phe Gin Lys Leu Gin Arg Gin Tyr Gly Leu Asp Asn Ser Gly 
35 1 40 45 

GGA TAC AAA CAA CAG CAG CTG CGA CAC , ATG GAG CTA GAA GTA ACG AGG 3295 
Gly Tyr Lys Gin Gin Gin Leu Arg His Met Glu Leu Glu Val Thr Arg 
50 55 60 65 

GGA CGA ATG CAT CCA TCT GAG TTC CAC AGC AGA AAA GCT GAC ATG TTG 3343 
Gly Arg Met His Pro Ser' Glu Phe His Ser Arg Lys Ala Asp Met Leu 

70 75 80 

GAA TCA ATC GCT GTT GGT ATT GAT GAT GGA AAA ACA AAA ACT TCT GGA 3391 
Glu Ser lie Ala Val Gly lie Asp Asp Gly Lys Thr Lys Thr Ser Gly 

85 90 95 

ATT ATT GAA GCC CTC CAC AGG TAT TAT CAG AAC ATT GCC ACA GAT GTA 3439 
lie He Glu Ala Leu His Arg Tyr Tyr Gin Asn He Ala Thr Asp Val 
100 105 110 

AGG TGT GTG TGG CTG TCT ACA GTG GTG GAT CAC TTT CAT TCA TCT TTT 3487 
Arg Cys Val Trp Leu Ser Thr Val Val Asp His Phe His Ser Ser Phe 
115 120 125 

GGG GAC AAA GGT TGG GGT TGC GGT TAT AGA AAT TTC CAG ATG CTG CTT 3535 
Gly Asp Lys Gly Trp Gly Cys Gly Tyr Arg Asn Phe Gin Met Leu Leu 
130 135 140 145 

TCA TCA TTA CTG CAG AAT GAA GTG TAC AGT GAC TGC TTG AAA GGT ATG 3583 
Ser Ser Leu Leu Gin Asn Glu Val Tyr Ser Asp Cys Leu Lys Gly Met 

150 155 160 

TCA GTT CCT TGT ATT CCA AAA ATT CAG TCC ATG ATT GAA GAT GCG TGG 3631 
Ser Val Pro Cys He Pro Lys He Gin Ser Met He Glu Asp Ala Trp 

165 170 175 

AAT GAA GGT TTT GAT CCT CAG GGG GCC TCT CAA CTT AAT AAC AAG TTG 3679 
Asn Glu Gly Phe Asp Pro Gin Gly Ala Ser Gin Leu Asn Asn Lys Leu 
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180 185 190 

CAG GGG ACC AAG GCC TGG ATT GGA GCA TGT GAG ATC TAT ACA CTT CTG 3727 
Gin Gly Thr Lys Ala Trp lie Gly Ala Cys Glu lie Tyr Thr Leu Leu < 
195 200 . 205 

i 

ACC TCA CTG AAA GTC AAG TGC CGC ATT ATT GAT TTT CAC AAG TCA ACT 3775 
Thr Ser Leu Lys Val Lys Cys Arg lie lie Asp Phe His Lys Ser Thr 
210 215 220 225 

GGT CCT TCA GGC ACA CAC CCT CGC TTA TTC GAG TGG ATA CTG AAC TAT 3823 
Gly Pro Ser Gly Thr His Pro Arg Leu Phe Glu Trp He Leu Asn Tyr 

230 235 240 

TAT TCC TCA GAG ACG GAA GGG GCG CCA AAG GTC GTG TGT ACA TCC AAA 3871 
Tyr Ser Ser Glu Thr Glu Gly Ala Pro Lys Val Val Cys Thr Ser Lys 

245 , , , 250 255 

i < 

CCT CCT GTC TAT CTC CAG CAT CAG GGT CAC AGT CGA ACA GTT GTT GGG 3919 

Pro Pro Val Tyr Leu Gin His Gin ,Gly His Ser Arg Thr Val Val Gly 
260 - 265 ' 270 

ATT GAA GAG AGA AAA AAC CGA ACA TTA TGT TTG CTA ATA TTT GAT CCT 3967 
He Glu Glu Arg Lys Asn Arg Thr Leu Cys Leu Leu He Phe Asp Pro 
275 , 280 ' 285 

GGA TGT CCT TCT C^A GAA ATG CAG AAG CTG TTA AAG CAA GAC ATG GAG 4015 
Gly Cys Pro Ser Arg Glu Met Gin Lys Leu Leu Lys Gin Asp Met Glu 
290 295 300 305 

GCT GGT AGT CTC AGG CAG CTC CGG AAA TGT GTG GGA AAT CTG AAG CAT 4063 
Ala Gly Ser Leu Arg Gin Leu Arg Lys Cys Val Gly Asn Leu Lys His 

310 315 320 

AAG CAG TAC CAG ATA GTA GCA GTG GAG GGT ATC CTG TCC CCA GAG GAG 4111 
Lys Gin Tyr Gin He Val Ala Val Glu Gly He Leu Ser Pro Glu Glu 

325 330 335 

AGA GCT GCC AGG AAA CAA OCT TCT CAA GTG TTT ACA GCG GAG AAG ATT 4159 

Arc Ala Ala Arc Lys Gin Ala Ser G3n Va3 Phe Thr Ala Glu Lys lie 
340 345 350 

CCT TGACCCAGAT ATTTTGTAAT TATCCTTTTT GTTTCCGATA TTGAACTCTG 4212 
Pro 



ATACAGTTGA AGAATTTGAC TTCACTGAAG TCGGTGATAA ATTATTTAAG TTATAAATGC 4272 
CTGTTGTACT CCTTAGCATT TCGTTGTCTG GTAATCAGTT GAATAAATTC CTTGTTTACA 4332 
TTAAAAA 4339 

(2) INFORMATION FOR SEQ ID NO: 18: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 249 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
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Met Leu Ser Cys Asp He Cys Gly Glu Thr Val Thr Ser Glu Pro Asp 
15 10 15 

Arg Lys Ala His Leu He Val His Met Glu Asn Glu He He Cys Pro 

20 25 30 

Phe Cys Lys Leu Ser Gly He Asn Tyr Asn Glu Met Cys Phe His He 
35 40 45 

Glu Thr Ala His Phe Glu Gin Thr Thr Pro Glu Lys Ser Phe Glu Thr 
50 55 60 

Leu Ala Ala Val Gin Tyr Glu Ash Ser Asp Leu Gly Asn Thr Lys Leu 

65 70 75 80 

. i 

i 

His Ser Thr Val Glu Val Thr Ser Gly He His Ser Ala Cys Ala Ser 

85 90 ' 95 

Asn Phe Pro Lys Glu Ser Ser Glu Ser Leu Ser Lys Asp Arg Thr Leu 

100 105 110 

Lys Gin Glu Ala Phe Tyr Thr Glu Ser Val Ala Glu Ser Arg Lys Tyr 
115 120 125 

Gin Lys Ser Arg Glu Lys Gin Ser Gly Leu Ser Glu Ala Gin Gly Ser 
130 135 140 

lie Tyr Glu Thr Thr Tyr Ser Pro Pro Glu Cys Pro Phe Cys Gly Arg 
145 150 155 160 

He Glu Arg Tyr Ser Gin Asp Met Glu He His Val Lys Thr Lys His 

165 170 175 

Ala Ser Leu Leu Glu Ser Pro Leu Glu Asp Cys His Gin Pro Leu Tyr 

180 185 190 

Asp Cys Pro Met Cys Gly Leu Val Cys Thr Asn Tyr His He Leu Gin 
195 200 205 

Glu His Val Asp Leu His Leu Glu Glu Ser Ser Phe Gin Gin Val Leu 
210 215 220 



Val Ser Gly Tyr Val Pro Val Trp Thr Pro Asp Leu Arg Cys His Pro 
225 230 235 240 

Gin Glu Gly Arg Thr Thr Pro Leu Arg 

245 



(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 354 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

Met Asp Arg Val Gin Cys Ser Ser Asp Arg Glu Leu Ala His Gin Leu 
15 10 15 
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_ „ Tvfi Ser Glu Glu Ser Arg Gin Glu 
Gin Gin Glu Glu Glu Arg Lys Arg Lys Ser Glu o ^ 

20 

Arg Glu Glu Phe Gin Lys Leu Gin Arg alb Tyr Gly Leu Asp Asn Ser 



50 55 



oly Gly Tyr Lys Gin Gin Gin Leu Arg His Met Glu Leu Glu Val Thr 

Glu Phe His Ser Arg Lys Ala Asp Met 
torg Qiy *tg ''"='- *70 75 

2 Glu Ser II. Al. Val Gly He Asp Asp G!y Lys Thr Lys Thr S.r 

81y He He Glu Ala Leu »i» «. Jg ^ Gin As* He Al. Thr ASP 

V.1 Ar, CVS Va! Trp Leu Ser Thr -1 val Asp Hi. Phe Hi. Ser Ssr 

115 120 , 

P„e ffi «P Lys Gly Trp Gly CVS Gly Tyr *. » P»e «» "* - 

M Z sex Leu Leu ?1 n .sn Glu Val Tyr Ser Asp Cy. Leu Lys «g 

145 15 

„ et ser Val Pro cys He Pro ,y. He Gin Ser Het He Glu »p A!a 

165 t \ 

m r*-\*, &1a qpr Gin Leu Asn Asn Lys 
Trp Asn Glu Gly Phe Asp Pro Gin Gly Ala Ser Gin ^ 

180 

L .u Gin Gly Thr Ly. Ala Trp II; Gly Ala Cy. Glu lie Tyr Thr Leu 
L» Thr s" Leu Lys Val Lys cys Ar, lie He Asp Phe His Lys Ser 

210 

, * *»r Glv Thr His Pro Arg Leu Phe Glu Trp He Leu Asn 
Thr Gly Pro Ser Gly inr nx* 235 2 4u 

225 

^ ^ ser ser Glu Thr Glu Gly Ala Pro Lys Val Val Cy. Thr S.r 
Ly . Pro Pro V.1 W r Leu Gl» Hi. Gin G!y His Ser Ar„ Thr Val Va! 

01 y He Glu Z Ar 9 Lys Asn Ar ? Thr Leu Cy. Leu Leu He Phe Asp 

275 

o - Glu Met Gin Lys Leu Leu Lys Gin Asp Met 
Pro Gly Cys Pro Ser Arg Glu Met uxn y jqq 

290 

G1 u Ala Gly ser Leu Ar, Gin Leu Ar g Lys cy. val Gly Asn Leu Lys 

305 

His Lys Gin Tyr Gl» He Val Al. val Glu Gly He Leu Ser Pro Glu 

32b 

ala Ser Gin Val Phe Thr Ala Glu Lys 
Glu Arg Ala Ala Arg Lys Gin Ala Ser Gin ^ 

340 J * 3 
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lie Pro 



(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2415 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

■ 

(iii) HYPOTHETICAL: NO . 



( vi i ) IMMEDIATE SOURCE : 
(B) CLONE: HW035 

(ix) FEATURE: 

(A) NAME /KEY: misc_£ eature • . 

(B) LOCATION: complement (1738.. 1910) 
(D) OTHER INFORMATION: /label* SAC 24468 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

i , 



GTCGACGGGC 


CCTATTCTGC 


CTCCTTCAGG 


ATCTGCGTCT 


TCCAGGGATC 


CGCTGTTCGA 


60 


CTTGGAGGTG 


CTTCTCTGCA 


GGTCCCAAGC 


CTGCCTCCTC 


ACCCCAGCTT 


TCCAGTGGAG 


120 


GCTGCCCCCT 


TCCAGGCTTG 


GCTCAGGAAG 


CTCCTCATAG 


GCGGATGCCC 


ACCCATAAGA 


180 


GGCTACATCC 


CTTTGTGGTC 


ACATGGCCTG 


TAGCCTCTTG 


CTGACATTGC 


TGGTGTCTTC 


240 


TCTGGGGCCC 


TGGATCCCTG 


ACAAGCACAG 


AAATGTGATC 


TGGTAAAAGG 


AAAGACTGTG 


300 


GCCTGATAGG 


CCTTTACCTC 


GTAGGCAGCA 


GGATTTTCAC 


TCAGTTTGAC 


ACAGCATGCA 


360 


GTGCTCTGGG 


AGCAGGACCT 


CTGCCACCCT 


GGCATTTTTG 


GTAGGCTGGT 


TTGGCTCAAG 


420 


GTGCTCAGAC 


ACTCCCATTG 


ATGCCTGAGT 


CGGATATGGG 


TGGAAATCTG 


GACTGTTGAT 


480 


TATAAGTTAC 


CAGGTGACCT 


CATTCCCTCC 


CTTTGTCTCT 


TAACTGGTAT 


CCAGTGAGTT 


540 


GCTGATGCCG 


ACTGGCAAGC 


AGAGCTGGCC 


CATGCCTAGA 


GGAGGAGAGG 


GACAGGCTTT 


600 


CTGAGTGCTT 


TGATTCTGGC 


TGGGCTGTAG 


TGTATAGCTG 


AGAATGGCCT 


CCTGCATGGG 


660 


TCCAGCCTTT 


TGGGAGGATG 


AGGAAGAGGA 


GGCTGAGGCT AGATGTGTGT 


TAAACAGGAG 


720 


TAGCAGATCT 


TTGCTTCTGA 


ACCGGATAGA 


GCCGTGCTCA 


AGGCTTTCAG 


GTTTGCAGTA 


780 


TCTTCTTATT 


GAGTCTGGGG 


AACGTTCCCT 


ATTGAGCCAG 


CCCTGCAGCT 


CCACTCCCTG 


840 


CCCCATTCCC 


CTGATTTCTT 


GACAGTAAAT 


CCTGTGTCCA 


GTTGTGACTG 


TTGGCTGTTT 


900 


TCCTGATCAT 


TGTTAATTCC 


GTTTTCCTAC 


TGTGAGGGTG 


TTGTGTTTGA 


CTTTCTGACA 


960 


CAGCGAATGG 


AGAGCTTGGG 


CTGAGGCCAG 


GTCTCCCAAA 


CTTCTCCGCT 


AAGTAAACCG 


1020 


TAATGTGTGA 


CTTCCCTCCT 


ATCTTGACTG 


CCCGACTCCT 


TCCTTCCACA 


TGGACACTCA 


1080 
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P ATTTP A A Af3 


AAA ACCCdC A 


GGAGGTGGGT 

OOAww X w<J<J X 


PAGTTTPPTP 

w AO XXX Ww X w 


TGTAGAATAT 

X w X «W3i**V X «» X 


GGTPPPAGAG 

ww X WWW Aw Aw 


11*10 

X JL IB w 


ahtpptpppp 


TPTf2TTf2Tf2T 

iWlUl X w X O X 


PTAGPTGPTA 

W X AOW X OW X *» 

1 


GTPTGGGTGT 

w X W X OOO X V? X 


TGTAPCATCA 

X w X <»W W J* X WA 


GTGCAGGRAG 


1200 

At W W 


TPPAPGGPAA 

1 W W AWwVJ wAA 


PACTGAGTTA 

W**W X W*»W X X «» 


TACAGGGACC 

X AW AwwWX^W W 


ACTACCATCG 

AW X nvwo X W w 


CCAGGCTTCC 

WW** Vjw W X X WW 


TAGCAGGCTG 

X A\JvAwwW X W 


1260 

ab At W W 

t 


TTPTTAG TTP 
1 XUX w Aw x x w. 


ATCTTTCAGG 

C\ X W A X X W*»W\J 


TCCGTGTGTA 

x www a w a wr x n 


CTCTGTGCTA 

W X W X W X WW X ** 


GCAGCAGCCT 

w^wr*wwnws#w> X 


TGGTGTTCAA 


1320 


GfiPP AGG ATT 


TGPAGTTTTG 

X w Viiw X X X X %9 


ATAAGCTACA 

1* A **i»WV X **W4* 


GCCCACTTGG 

WWWW«*W X X WW 


AGATGGTGAG 


GCCACTTCCC 

VJ W W4*W X XWWW 


1380 


TGPTAPPTCC 
X v*w x nv>u x w w 


CTPTCTGGTG 

W ib V* X W A WW A W 


TGGCCACTTG 


GCCAAAGCTC 

W W W ****** W W X w 


CCATCCACAG 


CTGGGGTGTC 

W X W WWW X W X w 


1440 

«fc ^ ^ w 

i 


TfiAfiPPAPAG 


PPTGCTAPAC 

WW X WW X /iwlV 


TGTTTPPGPT 

X O X X X WW WW X 


PTGGG ATP TT 

W X VJwv/» X W X X 


PTGAGTTACT 

W X UAw X X *»W X 


PTTTTAGGPT 

W X X X X **w W W X 


1500 


fiTfSftlTTTfifST 
O X ww 111 ww X 


GAAAGGAApp 
waaao unnv w 


AAPAPATTAA 
nnv.nuA x x aa 


PGATPTPPPP 

w OA X W X W w w w 


PPPPPPPPPP 
WWwwwWWWWW 


AAAGPPAPTn 

AAA ww W AW X w 


1560 


/ivj X aa x x v. 1 1 


www A X W X ,X X X 

i ' 


PTAAfiTfiiPTfi 
w X nno X ww x o 


TTW2PTnPPT 
X X www X Ow w X 

i 


VJvJ XXX wVJAU 


p a fSTTnnvsn a 

w Aw X X V? X Urtjf A 


i 


fiAfifltAPAGAP 


TT AGT AC P AC 
x x Aw x nuwnv 


APAGAGGPAT 

AW AwAOOw A X 


GGGTGTGPGA 

O OO X O X O WOA 


GPTGPTPAPA 

Uw X X Wjltwft 


f2PTTPPTTRT 

WW X X WW; X X w X 


1680 

<x u u u 


TTPTCC2P ATf2 


i 

Tf2TPPTtTGf2T 

XUX w w X wOw X 


GPGTfiSPTATA 

wW O X OW X A X A 


LV« X wwwvjwww 


PAfiPTfiPAPA 

wAV3w XUwnwA 


PPT A TfiT APT 

ww X A X vS X aw X 


1 740 


PTHPPPPTGA 


GAGGTAGAfiT 
vjavw x AwAw x 


GTTPAGGAAra 

W X X WAOOAAO 


i 

TGTPPPTGAT 

X \3 X WWW X \Jr\ X 


AGATTPPPTG 

ry\9f\ x X www x V7 


TTPTPTf2TGf3 

X XwXwXwX WW 


1B00 

X u w w 


PPPPAGTTGG 

WWWWAw X X wO 


> 

G PTTTTGGPT 

WW XXX X WOW X 

't 


CCTCPAGPAA 

WW X W WAOWAA 


CTTPATGPPP 

wX X w A X w W W w 


PAAPPPTAPP 

W/WWWW X AW w 

t 


TTTPPTPPPA 

X X X ww X Www A 


I860 

JL S3 U V 


TTTPTPTPTfJ 


tptgptJ^gtp 

X w X O w X Aw X w 


PPTGAAGPPT 

w w X OAAO w w X 


TTAAPPAAAT 

X X nAU w/l/l/\ X 


GGGAATGGGT 

X www X 


APAGAAARPP 

Aw AUAAAU w W 


1920 


TTPTPPTGCJP 


AAPTTAGGGP 

Anv X X AOww W 


AAPAGGAPAA 


nnnPTATTfVP 

wwwW XtXX XwX 


GTGTPPAAPT 

w X w X W W AX\W X 


PTRTTRGTPT , 

W X w X X ww X w X 


1980 

JL >7 W w 


PPTPPTfiPTfi 
ww iuuXuL X w 


AfiAfifiTfSTfiT 

AV3AV30 X W X W X 


PPAAfiPAfiAfS 
w Lnnu w Aonu 


TTfiATPAfiPP 
X Xw/iXw/iOww 


PPTfiPPTttPP 
ww X www X www 


W X W W AWWWW X 


r 

5040 


riA Af^pp ancrz 

waaowwaooo 


fSfJAflOTTART 
NiunVju x x nu X 

1 


ACJAfiAfiTAPP 
AuAOA\3 1 Aw w 


PTA A ATf2nnf2 

W lAAAlVSUUU 


Tf3Afif2f3TPPT 

X \jtv3\3\J X ww 1 


TAAAPATPPA 

X AAAwAX WWA 


«xuu 


aw l waouao w 


wow 1 w 1 X o X o 


TC(2PTf2P A A f2 
i WWU X OwAAw 


fiTV; A^2 A PPPT 


^2TPTPPAf3PA 


PPTfif2C!TAfif2 

WW X \3\3\3 X AW W 


91 fin 


ppp 7a rpnr^rjnn 

ww WA X W OOOO 


A TPTflCPTf^P 


A AfiflTP. AfipfP 
aaoo 1 onyu 1 


w X wnL x x our 


P A TP A OTHP A 

L-iH. X W AO X V7w A 


CI AfJPf3 ATP A A 

OA ww w A X W AA 




GTCACCTGGC 


CTCAGCAGGC 


AGCAGCCTTG 


GTGCCCACCC 


GTGCCCTGCG 


CAGTATTTAT 


22 80 


TGCTAAATTA 


TTGTCCAGGA 


GGGGCGGCAC 


TGGGCCTGGC 


CCCCCGGGTA 


TTTATTGCTG 


2340 


TACATAGTGT 


ATGTTTGTGA 


TATATAAGGT 


TTTCTTTATT 


TTGTATATGA 


TCAATAAACG 


2400 


CCTTTTAAAG 


AAAAA 










2415 


(2) INFORMATION FOR SEQ ID NO; 21 


• 
• 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 3531 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: HW036 
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<ix) FEATURE: 

(A) NAME /KEY: misc_feature 

(B) LOCATION: complement (2541.. 2575) 
(D) OTHER INFORMATION: /label* SAC.24406 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 



GTCGACTGAA 


AACCACTCAG 


AATTTTCTCT 


GGTCCTTAAC 


AGAAAAGAAT 


CAAAGTAGGA 


60 


AGCACAGTAG 


CTGCCACATC 


TGTTCAACCT 


CTGGGCCTCC 


CAAGTACACC 


AGAGAAAACA 


120 


GATCAATGCT 


GTTTCTGAAT 


GATTCTGCTG 


GTTAAAGTCA 


TTTATAGAAG 


ATTCACGATA 


180 


CTGAACAACT 


ATGGGTTAAG 


AACTTACTGT 


GTTTTGTGAA 


TTGCCCAATG 


GAGAGAGTGT 


240 


GTTCAAGGAT GACTCTTGAT CAGATTACAC AGTATGTGCA TCCAGGAGGC CACATGAAAT 

i 


300 


CAGCATAGTT 


AGTAACATAG 


AATCTGTTGT 


CAGCCTCCTC 


CAACTCGTGT 


CTGTTTCTTG 


360 


GTTTCAATTT 


TAGGCCAGCC 


CTCTCCTCAT 


AAAGGGGTGA 


TAGTAGCCAC 


AGTCCACAGC 


420 


CTCAAGGTCT 


GGTCTCCAAA 


AAGAAGAGTG 


TGAGCTGGGC 


c:acaatcata 


CCCTCCTGAA 


480 


GAACTCTGGT 


CTTTTCTGGT 


CATATGCTCC 


TGCCCCACCC 


TTGGTGATCC 


TGGCTAGATC 


540 


ATGGTCTCCC 


GTACTGTGCC 


GACACCATTC 


ACAGTAACAT 


AGGATGAGTT 


ATGGGCCTCA 


600 


TCCCTAAACA 


CTGGAACTTG 


CAGCAAATGA 


TGTAGCAGAT 

i 


&TTCACTCTA 


GATAACAAAT 


660 


GTCAGTGACA TGAAGTGGCT AGGAAAGTTT AGAACCCTGA TCATTTTATT TTTACCACTT 


720 


AACACAGTAG 


AGAGCCATCA 


GGACTAAATC 


TTTGGTGTGA 


TTCTTGCCAT 


TGAAGTCATT 


780 


ATAGCTTCCA 


TATTGTTCCG 


TAGTTCACTG 


GAAAGTGCAC 


TTGCTGTTAA 


AAGTTTTGAA 


840 


GCTGTGAGGA 


TTCCCTAGGG 


TGTCACTGAA 


GTCTGCTCTG 


TATACTGCCC 


CCACTGGGAG 


900 


TGCACGGTAC 


TGCACCCTTA 


AATCCTGCCG 


CCTGTGTCCC 


AAACATGGTG 


TCTGCGTGCA 


960 


GTGTGGAGTA 


GTGCGCCTTC 


TTTTTTTTTT 


TTTTTI'TTTT 


CCTATTCTTT 


TTTTGGAGCT 


1020 


GGGAACCGAA 


CCCAGGGCCT 


TGTGCTTGCT 


AGGCAAGTGC 


TCTACCACTG 


AGCCAAATCC 


1080 


CCAACCCCTG 


TAGTGCGCCT 


TCTATACTAG 


AAAGCTTGAC 


CACTGAGCCA 


CACCTCCCAC 


1140 


TAGTGCTTCA 


ATGTCAACCG 


AGAGTAAAAT 


GTGTTTGTAT 


GAAATGCCTC 


CATTTGACTA 


1200 


GATAGAGCTT 


TATTTGGAGA AAGTCACATA 


TAACATAATT 


GAACTTTGAA 


TTATACAATC 


1260 


CCGTGGATTT 


TAGAGTGCTC 


CTGGAGCAGG 


TGGCAGTCAC 


CACTATCTAC 


TTCCAGAACA 


1320 


GTCTCATCCT 


TTCCAGAAAC 


CCACACTCTG 


TCTTTCCTCT 


ATTCCAGATC 


TGTTAGACGA 


1380 


GTGGAATTAC 


ATAGTCCGGT 


CTTTTCTGAG 


TTCTGTTACT AAGTTTTAAA 


GGTTTATTCT 


1440 


CAGGTAGCAT 


CAGTCCGTAA 


TGTATTACTG 


CTGAATAGTG 


TTCCGTGTAT 


ACAGACACCG 


1500 


TGTGTGTCTT 


CTTCCAGCGA 


GCAGAGGAAC 


TCTGAGCTGT 


TTCTACTTTG 


GGGCTTTTGA 


1560 


CTAATGCTAT 


GAACATCTGT 


GAAAAAGTTC 


GAAATGTTTG 


ATTTAGTACA 


GACCCTAGTG 


1620 


GGGAGCTCCG 


GGGTCATATT 


ATGACAGCCT 


. CAATTGTACT 


TCCTACAGTG 


GTTTTACCAC 


1680 
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CATTTCCTGC 


TCTCGTGGGA 


TCTAGGCTCC 


AGCATCCCTC 


ACAACTTTCT 


GCCTGAGATG 


1740 


AAGAGGCATC 


TGATTGGGAT 


CTTGGTTTGC 

i 


ATTTCCCTAA 


TGTCTAATAA 


TCTGAGCTTT 


1800 


TTTTCATGTG 


TTCATTGGCT 


TTCTATGCTG 


< * 

CTTTGCAGAA 


TGTTTATTTC 


AGGCTACAGT 


1860, 


CTGCCTTTCA 


GCTGGGTTAT 


CTTTCTGTTT 


TTCTGTAGGA 


TTTTTTATTT 


ACGGTCAACT 


1920 


CATCTCTTAA 


AGATTAATTG 


GCATTTTTTT 


TTTTCTCAAC 


TTGTGGTTTG 


CCCTTCTCAA 


1980 


TGTCTTGATG 


CTCTCTGAAG 


CACAGTTTTT 


ATTTTTATAT 


ATTGCCTTTC 


TGCGTCTTAG 


2040 

i 


CCCAGGCTGG 


CCAGGAATCC 


CCAAAGCCTT 


GTACACAGAG 


TGAGCATTTT 


CACCGCCTCA 


2100 


CTTCCCAGAA 


GGCACACGGC 


GCTGCTCCTG 


CCTAGAGCAG 


TTTTCATATT 


GGAAAATAAC 


2160 


AATGTGTTTT 


TTCTTTGAGT 


GTATCCTAAG 


AAACCATTCC 


TAACCTGGAC 


TTACAGAAGC 


2220 


TTATGTCTTC 


ACTAAAGAAT 


TTTTAAATTT 


TACCTCTTAC 


CTGTAGGCCT 


GTGCTCTCAA 


2280 


AATGTTACAC ACCTTTATTC ATGGCCATAT TTAAAATTAA GTAGAAATAG TGAGTATGTA 


2340 


GATATGCCTG 


TATCGTGATG 


AGCACATTCA 


CAGTGCAGTG 


CAGTTGGCCT 


GCAGCCCAGT 


2400 


TCTGCATCCA CCTCAGAaIaA TGCCCAGTGA GCC^ATCTAA TGACTCCCAG CCCCTGCCAG 


2460 


GTCCTCCCTC 


CCTGCdcTGT 


CTTGGCATAC 


TCTCTTCTCC 


ATGGCAATCA 


CCATTCTCTG 


2520 


TCCTGTTTCT 


GAATTTGAGT 


ACCCTAAGTG 


CCTGGTATAC 


ATAGATTCTC 


AGTGCGTGTG - 


2580 


TGTGTGTGTG 


TGTGTGTGTG 


TGTGCGTGCG 


TGTTCATGTA 


GCAGAATGCC 


GTGAAGGCTT 


2640 


GTATCATGTA 


TGAGTTTCCT 

i 


TTGTAAGGAC 


AAATAGTACT 


TTAGACTTTA 


GTGTGTGTTG 


2700 


TATGTATGCT 


GTATGTTTCT 


TGTTACCTAC 


TCATCCAGCC 


GTGGGTACCT 


CAGTTGTTCC 


2760 


CACCCTGGAG 


CTCCTGACTA 


TGCTGTGAAC 


ATGTATATAC 


AAATATCCCT 


CGGATCCTGC 


2820 


TTAGCATACA 


CGCAGGAGAT 


TGTATGTCTT 


AGGGTAACTC 


TGCTGTTATT 


AGTAGGCACT 


2680 


GCCGTAGTCT 


CCATAGTGAC 


CACACTATTG 


TATGTCCTGG 


CAGCAGTGCA 


CAGAGATTAG 


2940 


AAGTGCCCAT 


GTCCTTGTGA 


GTCTTACTCT 


TTTTTTAGTG 


GCTGTTCTAG 


TAGGTGTTAA 


3000 


GATCTGCTAG 


TGCTTCTGCT 


TCCCATCTCT 


CTGACTACTG 


ATGTCACATG 


TTTTATATGC 


3060 


TTGGTGGCCT 


TCTTTATATA 


TCGTTTGAAG 


AAATCTCTTT 


AGGTCCACTG 


CCCATTAATA 


3120 


GTTGGGTCGT 


TGGCTCTGGT 


TGAATTTTTG 


GTAGTCACAT 


CCACAAAAGC 


ATTAACCCAC 


3180 


CCAAGGGAGC 


CTACTACAGC 


TTGCCCTAGG AAACTTGATT 


TTAGCTCTTA 


GCTTTGCTCT 


3240 


CTGGTATATT 


TTGAGTTCAT 


TTGTGTGTGA 


GATGTAAGGT 


CTGACTTTCT 


ACTCTACCCA 


3300 


TGTGGATACC 


TGGCTTTGCA 


TGTTGCCCTT 


GGTGGGAAAA 


CATCCCGCTG 


TGCATGGTGG 


3360 


GGCTGTTTCT 


GAGCCTGGTC 


TCTTCCATGA 


CCTCTGTCTG 


CCTTCATGCC 


AACGCCACTG 


3420 


CTTTGATTAC 


TGGAGAATTA 


TAATAGAAAT 


TTGAAATCAG 


AAAATGTTTG 


TTCTGTAATA 


3480 


AAAATTTGAA ATCAGAAAGT 


GTTCTATAAT 


AAAAATTTGA 


AATCAGAAAA 


A 


3531 
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(2) INFORMATION FOR SEQ ID NO; 22; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2127 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: HW037 

i 

(ix) FEATURE: 

(A) NAME /KEY: CDS 1 

(B) LOCATION: 280.. 1422 

{ix> FEATURE: 

(A) NAME /KEY: misc.feature 

(B) LOCATION: 1370.. 1796 1 

, (D) OTHER INFORMATION ; /label- SAC_24354 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

GTCGACCTGC GGCTGGCCGG CGGGGCAGGC CCCGGACCCG ( TGCGGTTCCT GGGCATGGTG 60 

AGCAAGGGGC TGCTGCGCCT TGTCTCTTCA GTCAACCGCA GGAAGATQAA GCTTCTGCTG 120 

GGCATCGCGC TGTTCGCCTA CGCCGCCTAA CTTGCTCATG ATGGAGTGCC AAGTGCCTCA 180 

GAGAAACAGG CCACCAGAAG AACAGTGCAT TCTCAGTACA AGGGCAGCGT TCAGCATTTG 240 

GATATTGGAG TTGAAGCTGT TTGGGGCAAC TTTGTTAAC ATG AGG TCT ATC CAG 294 

Met Arg Ser lie Gin 
1 5 

GAA AAT GGT GAA CTA AAG ATT GAA AGC AAG ATT GAA GAG ATT ATT GAA 342 
Glu Asn Gly Glu Leu Lys lie Glu Ser Lys He Glu Glu He He Glu 

10 15 20 

CCA TTA AGA GAG AAA ATC AGA GAT TTG GAA AAA AGT TTC ACC CAG AAA 390 
Pro Leu Arg Glu Lys He Arg Asp Leu Glu Lys Ser Phe Thr Gin Lys 

25 30 35 

TAC CCA CCA GTA AAG TTT TTG TCA GAA AAG GAC CGG AAG AGG ATT TTG 438 
Tyr Pro Pro Val Lys Phe Leu Ser Glu Lys Asp Arg Lys Arg He Leu 
40 45 50 

ATC ACT GGA GGT GCG GGC TTT GTG GGC TCC CAT CTA ACT GAC AAA CTC 486 
He Thr Gly Gly Ala Gly Phe Val Gly Ser His Leu Thr Asp Lys Leu 
55 60 65 

ATG ATG GAT GGC CAT GAG GTG ACC GTG GTG GAC AAC TTC TTC ACA GGC 534 
Met Met Asp Gly His Glu Val Thr Val Val Asp Asn Phe Phe Thr Gly 
70 75 80 85 

AGG AAG AGA AAT GTG GAA CAC TGG ATT GGC CAT GAG AAC TTC GAG CTG 582 
Arg Lys Arg Asn Val Glu His Trp lie Gly His Glu Asn Phe Glu Leu 

90 95 100 



-72- 



SUBSTITUTE SHEET (RULE 26) 



WO 98/53071 



PCT/US98/10547 



ATT AAC CAT GAT GTG GTA GAG CCA CTC TAC ATC GAA GTT GAC CAG ATC 630 
lie Asn His Asp Val Val Glu Pro Leu Tyr He Glu Val Asp Gin He 

105 110 115 

TAC CAT CTG GCT TCT CCA GCC TCC CCT CCA AAC TAC ATG TAC AAC CCC 678 
Tyr His Leu Ala Ser Pro Ala Ser Pro Pro Asn Tyr Met Tyr Asn Pro 
120 125 130 

ATC AAG ACC CTG AAG ACA AAT ACA ATT GGA ACA CTA AAC ATG TTG GGA 726 
He Lys Thr Leu Lys Thr Aen Thr He Gly Thr Leu Asn Met Leu Gly 
135 140 145 

CTG GCA AAG CGT GTG GGT GCC CGT CTG CTC CTA GCC TCC ACA TCT GAG 774 
Leu Ala Lys Arg Val Gly Ala Arg Leu Leu Leu Ala Ser Thr Ser Glu 
150 155 160 ' 165 

GTA TAT GGA GAT CCT GAG GTC CAC CCT CAA AGT GAG GAC TAC TGG GGC 822 
Val Tyr Gly Asp Pro Glu Val His Pro Gin Ser Glu Asp Tyr Trp Gly 

170 175 180 

CAC GTG AAT CCC ATA GGA CCC CGA GCC TGC TAT GAC GAG GGC AAA CGT 870 
His Val Asn Pro He Gly Pro Arg Ala Cys Tyr Asp Glu Gly Lys Arg 

185 190 195 

GTT GCG GAA ACC ATG TGC TAT GCC TAC ATG AAG CAG GAA GGT GTG GAG 918 
Val Ala Glu Thr Met Cys Tyr Ala Tyr Met Lys Gin Glu Gly Val Glu 
200 205 ( 210 

GTG CGG GTG GCC AGG ATC TTC AAC ACC TTT GGG CCA CGA ATG CAC ATG 966 
Val Arg Val Ala Arg lie Phe Asn Thr Phe Gly Pro Arg Met His Met 
215 220 225 

AAT GAT GGG AGG GTG GTC AGC AAC TTC ATC TTA CAA GCG CTA CAA GGG 1014 
Asn Asp Gly Arg Val Val Ser Asn Phe He Leu Gin Ala Leu Gin Gly 
230 235 240 245 

GAG CCG CTC ACA GTA TAT GGA TCT GGG TCT CAG ACA CGG GCA TTC CAG 1062 
Glu Pro Leu Thr Val Tyr Gly Ser Gly Ser Gin Thr Arg Ala Phe Gin 

250 25E 260 

TAT GTT AGC GAT CTA GTG AAT GGC CTG GTA GCA CTG ATG AAC AGC AAT 1110 
Tyr Val Ser Asp Leu Val Asn Gly Leu Val Ala Leu Met Asn Ser Asn 

265 270 275 

GTC AGC AGC CCT GTC AAC CTG GGA AAT CCA GAA GAA CAC ACA ATC CTG 1158 
Val Ser Ser Pro Val Asn Leu Gly Asn Pro Glu Glu His Thr He Leu 
280 285 290 

GAA TTT GCT CAG TTA ATT AAA AAC CTT GTT GGT AGT GGA AGT GAA ATT 1206 
Glu Phe Ala Gin Leu He Lys Asn Leu Val Gly Ser Gly Ser Glu lie 
295 300 305 

CAG TTT CTC TCT GAA GCT CAG GAT GAT CCA CAG AAA AGA AAA CCA GAC 1254 
Gin Phe Leu Ser Glu Ala Gin Asp Asp Pro Gin Lys Arg Lys Pro Asp 
310 315 320 325 

ATC AAA AAA GCA AAA CTG ATG CTG GGG TGG GAG CCT GTG GTT CCA TTG 1302 
He Lys Lys Ala Lys Leu Met Leu Gly Trp Glu Pro Val Val Pro Leu 

330 335 340 

GAG GAA GGA TTG AAC AAA GCC ATC CAC TAT TTC CGG AAG GAA CTA GAG 1350 
Glu Glu Gly Leu Asn Lys Ala He His Tyr Phe Arg Lys Glu Leu Glu 
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345 350 355 

TAG CAG GCT AAT AAC CAG TAC ATC CCT AAA CCC AAG CCT GCC AGA GTG 1398 
Tyr Gin Ala Asn Asn Gin Tyr lie Pro Lys Pro Lys Pro Ala Arg Val , 
360 365 1 370 



AAG AAG GGC CGG ACG CGC CAC AGC TGAGTTAGCC TTGGGATGCG AGACTCTATT 1452 
Lys Lys Gly Arg Thr Arg His Ser 



j / j 




JOU 










X X AL> X 1 X A X A 


AV3V3 X OwAw X 1 




fTMTi rn/iiinnwri rnrnrn 
XXXXXXXXXX 


X X X X AAV9 A^# X 


T A A & P A f2 (VWl 


icn 


TPAffiAAHAA 


V- AnAV~ X UOnn 


X X X Xf\X 1 


i 

nnuu X X \9V» X X 


TA AAflAPAPf 
X AAAtvA\« A\- X 


M ATYSTfiPPT 1 A 
Oft X \ar 1 VJUU X A 


13 / « 


A A AftPTPPPT 1 


i 




i 

x x \a\0 n\r l X x x 


m* apfipTVJTP 


ifu furif riiiif 1 1 tv #tv2P 
11111 lAlvav. 


10 J* 


2l pa apa/^ppT* 

AV»AAL.AuCV» 1 


A ^ A 1Y2P A T^PP 

1 i ■ 




T*P & rt^2' IMI Kill III II 


1 1A1L. 1 ioV-i 


O 1 1 AVsAO 1 W 1 




ATGCTGTAAC 


TGTCACTGAC 


AGTTTTATTT 


ACTGGTTTCT 


TTGTGAAGCT 


GAAAAGAAAC 


1752 


ATTAAATGGG 


GTGGAAAATG 


CCAATTTTAT 


TTATAAAAGT 


GAGTACTTTA 


TAAATGAGAT 


1812 


GTTACACTAT 


GCATAAAGAA 


TACAAACCTA 


1 . i 
GAGGTATGGC 


CAGCTGGGCA 


GTGCACCAGT 


1872 


GTTTATTTGG 


GGACAGATGA 


AAGAACTCCA 


TTGGAAAGCT 


TTGCATTTCT 


TTTTTAATTC 


1932 


TGAATTTTCT 


GAAGGTCTAG 


TTTTCAGTTA 


CAGACTTGAC 


TTTGAAACAT 


CCCTGTTGGT 


1992 


TCTTGATCAA 


AGATATTTGA 


AATCACTACT 


GTGTTGTGCT 


GCATATTGGG 


GTGGGGGTGG 


2052 


GGACAATGTT 


AACATATTCT 


TGGTTAACCA 


TGGTTAAATA 


TGCTATTTTA 


ATAAAATATT ' 


2112 


GAAACTCGTC 


AAAAA 










2127 



(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 381 amino acids 

(E) TYPE: amino acid 
(D) TOPOLOGY: lineax 

(ii) MOLECULE TYPE: protein . 

ixi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

Met Arg Ser lie Gin Glu Asn Gly Glu Leu Lys He Glu Ser Lys He 
15 10 15 

Glu Glu He He Glu Pro Leu Arg Glu Lys He Arg Asp Leu Glu Lys 

20 25 30 

Ser Phe Thr Gin Lys Tyr Pro Pro Val Lys Phe Leu Ser Glu Lys Asp 
35 40 45 

Arg Lys Arg He Leu He Thr Gly Gly Ala Gly Phe Val Gly Ser His 
50 55 60 

Leu Thr Asp Lys Leu Met Met Asp Gly His Glu Val Thr Val Val Asp 
65 70 75 80 

Asn Phe Phe Thr Gly Arg Lys Arg Asn Val Glu His Trp He Gly His 

85 90 95 
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Glu Asn Phe Glu Leu lie Asn His Asp Val Val Glu Pro Leu Tyr lie 

100 105 110 

Glu Val Asp Gin lie Tyr His Leu Ala Ser Pro Ala Ser Pro Pro Asn 
115 120 125 

Tyr Met Tyr Asn Pro He Lys Thr Leu Lys Thr Asn Thr He Gly Thr 
130 135 140 

Leu Asn Met Leu Gly Leu Ala Lys Arg Val Gly Ala Arg Leu Leu Leu 
145 150 155 160 

Ala Ser Thr Ser Glu Val Tyr Gly Asp Pro Glu Val His Pro Gin Ser 

165 1 170 175 

Glu Asp Tyr Trp ( G^.y His Val Asn Pro He Gly Pro. Arg Ala Cys Tyr 

180, » ' 185 190 

Asp Glu Gly Lys Arg Val Ala Glu , Thr Met Cys Tyr Ala Tyr Met Lys 
195 . 200 205 



• i 



Gin Glu Gly Val Glu Val Arg Val Ala Arg He Phe Asn Thr Phe Gly 
210 215 220 

Pro Arg Met His Met' Asn Asp Gly Arg Val Val Ser Asn Phe He Leu 
225 ' ( 230 235 240 

Gin Ala Leu Gin Gly Glu Pro Leu Thr Val Tyr Gly Ser Gly Ser Gin 

245 • 250 255 

Thr Arg Ala Phe Gin Tyr Val Ser Asp Leu Val Asn Gly Leu Val Ala 

260 265 270 

Leu Met Asn Ser Asn Val Ser Ser Pro Val Asn Leu Gly Asn Pro Glu 
275 280 285 

Glu His Thr He Leu Glu Phe Ala Gin Leu He Lys Asn Leu Val Gly 
290 295 300 

Ser Gly Ser Glu He Gin Phe Leu Ser Glu Ala Gin Asp Asp Pro Gin 
305 310 315 320 

Lys Arg Lys Pro Asp He Lys Lys Ala Lys Leu Met Leu Gly Trp Glu 

325 330 335 

Pro Val Val Pro Leu Glu Glu Gly Leu Asn Lys Ala lie His Tyr Phe 

340 345 350 

Arg Lys Glu Leu Glu Tyr Gin Ala Asn Asn Gin Tyr lie Pro Lys Pro 
355 360 365 

Lys Pro Ala Arg Val Lys Lys Gly Arg Thr Arg His Ser 
370 375 380 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1785 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(vii) IMMEDIATE SOURCE: 
<B) CLONE: HW038 

(ix) FEATURE: 

(A) NAME/ KEY: misc_feature 

(B) LOCATION: 1440.. 1714 

(D) OTHER INFORMATION: /label* SAC 24332 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:24: 



GTCGACGGTT 


GAGTCTGATG 


AGGAAGATCA 


AGAAAAGACT 


GAGGAGACTC 


CTTCCCAGCT 


60 


GGACCAGGAT 


ACTCAAGTAC 


AGGACATGGA 


TGAGGGTTCA 


GACGATGAAG 


AAGAAGGACA 


120 


GAAAGTGCCT 


CCACCCCCAG 


AGACACCCAT 


GCCTCCACCT 


tTGCCCCCAA 


CACCAGACCA 


180 


AGTCATTGTG 


CGAAAGGACT 


ATGATCCCAA 


AGCTTCCAAG 


CCCCTTCCTC 


CAGCTCCTGC 


240 


TCCTGATGAG 


TATCTTGTGT 


CCCCCATAAC 


TGGGGAGAAG 


ATTCCTGCCA 


GCAAAATGCA 


300 


AGAACACATG 


CGCATCGGGC 


TTCTTGACCC 


CCGCTGGCTG 

i 


GAGCAACGGG 


ATCGCTCCAT 


360 


CCGAGAGAAG 


CAGAGCANTG 


ATGAGGTGTA 


CGCACCAGGT 


CTGGAITAI^TG 


AGAGCAGCCT 


420 


AAAGCAGTTG 


GCTGAGCGAC 


GTACTGACAT 


CTTTGGGGTA 


GAAGAGACAG 


CCATTGGTAA 


480 


GAAGATTGGT 


GAAGAGGAGA 


TCCAGAAGCC 


AGAGGAAAAG 


GTGACTTGGG 


ATGGCCATTC 


540 


AGGCAGCATG 


GCCCGGACCC 


AGCAGGCTGC 


CCAGGCCAAC 


ATCACCTCCA 


GGAGCAGATT 


600 


GAGGCCATCC 


ATAAGGCCAA 


GGGCCTGGTG 


CCAGAAGATG 


ACACCAAAGA 


GAAGATTGGC 


660 


CCTAGCAAGC 


CCAATGAAAT 


CCCTCAGCAG 


CCACCGCTCC 


ATCTTCAGCC 


ACCAATATCC 


720 


CCAGCTCAGC 


GCCCCCCATC 


ACGTCATTCC 


CCGGCCACCT 


GCGATGCCAC 


CTCCAGTCCG 


780 


CACCACTGTC 


GTATCTGCAG 


TGCCTGTCAT 


GCCTCGGCCC 


CCCATGGCAT 


CTGTGGTCCG 


840 


GCTGCCCCCA 


GGCTCAGTGA 


TTGCCCCGAT 


GCCACCCATC 


ATCCACGCAC 


CTAGGATCAA 


900 


CGTGGTACCC 


ATGCCTCCTG 


CAGCACCNCC 


CATCATGGCA 


CCCCGNCCAC 


CTCCCATGAT 


960 


TGTGCCAACA 


GCCTTTGTGC 


CTGCTCCCCC 


CTGTGGCTCC 


AGTGCCAGCC 


CCAGCTCCAA 


1020 


TGCCTCCCGT 


ACACCCCCCA 


CCTCCTATGG 


AAGATGAGCC 


TCCATCCAAG 


AAACTAAAGA 


1080 


CCGAGGACAG 


CCTCATGCCT 


GAGGAAGAGT 


TCCTGCGCAG 


GAACAAGGGT 


CCAGTATCCA 


1140 


TCAAGGTGCA 


GGTACCCAAC 


ATGCAGGATA 


AGACAGAATG 


GAAGCTGAAT 


GGGCAGGGGC 


1200 


TGGTCTTCAC 


TCTCCCGCTC 


ACAGACCAGG 


TCTCTGTCAT 


CAAAGTGAAG 


ATTCATGAAG 


1260 


CNACGGGCAT 


GCCTGCAGGG 


AAGCAGAAAC 


TACAGTATGA 


AGGCATCTTC 


ATCAAGGACT 


1320 


CCAACTCCCT 


GGCTTACTAC 


AACATGGCCA 


GTGGTGCTGT 


CATCCACTTG 


GCCCTCAAGG 


1380 
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AAAGAGGCGG CCGGAAGAAG TAGCCTCGAG AGATACTGCA AGTGGACTCC TGCCATTGTA 1440 

CCTCTTTCCA CCACCAATTC CGGGAGTTTC CCTGGGACCT CGGGCACAGC CTGGGCATTT 1500 

GTCTCCCTCT AAGGCAGTCT ATGGAATGGA TATCTGTTTA GATTTGGATT GGAGGTCCTT 1560 

CAAGGAAGCC CATTAGGAAT TGAAAACCCA GGGTTCTCCC ACAGGAGAGC TTCATAATAC 1620 

CAAGTAATGT TTGGCTAATA GAGTATGGTT ACTACCATTC ATCATTGTGT ATCATGCTTT 1680 

GGCCTCTAAT AAACCTCTTG AAGCCAGGCA TGGTACACAC ATGCCTGTAA TCCCGGGACT 1740 

TGGGGGGCGG GCAGAAGAAT TAAATGTTCA TCTTGGCTGT AAAAA 1785 
(2) INFORMATION FOR SEQ ID NO: 25: ' 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1054 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear ( 

(ii) , MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO i 
(iv) ANTI-SENSE: NO 



i 



(vii) IMMEDIATE SOURCE: 
(B) CLONE: HW039 



(ix) FEATURE: 

(A) NAME /KEY: misc_f eature 

(B) LOCATION: 231.. 617 

<D) OTHER INFORMATION: /label= SAC_24045 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:25: 



TTTTTCAGAA 


TTGGCTGACT 


TTAATTTGGA 


AACAGCTTTT 


GGCTACAGAC 


ATTCAGGTTT 


60 


AAGGCACATT 


CCAAACTACA 


AGGATCCCTG 


CCACTGGCAC 


GGGGCCTGGC 


TTTAATTCTT 


120 


GCATGGTTTT 


ACATGGAAAG 


TGGAACAAAT 


GATTTTCTGG 


CCTCTTCTAA 


AAAAGAGTGT 


180 


GAGTTTGGCA 


AAAACAAAAC 


TCATGACACA 


TTCTCACCAG 


ACACCACAGT 


ACCGTTACAA 


240 


GACCTCCCTT 


TTGTCCTAGG 


CAAAGACGGG 


TGGCCCGCTC 


ACCTACCATT 


CTCTCCAGAA 


300 


ACATGTATAC 


AGTAAGCATG 


CTTGTATGCG 


TCTGTCTGGC 


AGCGATGGTT 


AAAAAAAGTT 


360 


TTCAAATAAA 


AATACAAAAA 


TCACACGTCT 


ATAACAAAGC 


TAAGAGTCTT 


AGCCACAAGC 


420 


AGCGAGGAAT 


GAGTGTAGGA 


ACCAGGAGAG 


ATAAACATCA 


ATTCTGCACA 


GCAAGGGGCT 


460 


GTGAGGGCCA 


GGGAGATGGT 


CACCTGAGAA 


CCTGGCCCAC 


CTGGGCCCTT 


CGGCACAATA 


540 


CGTGGATGAG 


GGGGGACCAG 


AAGGAAAAGG 


TCATGGCTTT 


GGGGATGGGT 


GGATGGCTGG 


600 


GGGGTGCACT 


TGTGGGTACT 


AGATAGATTG 


GGGGAACTGG 


TGTGCTCTTC 


GGGACGTTTC 


660 
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CTGGCCACTG CCAGCAGGGG GCCAGCAGGG GGCACAGGTG AGCCACTGCC ACCGCTCAGT 720 

CGTTGCCACC ACAGATCTTC AGCAGGATCT TCCGATAATC CCCAGAAGTG TCTCCCGTGA 780 

TATCGTGGTA CAGTGACTTG CCGTACATCC GCTTATACTC TGCTCGGATA TCCAGCAGGT 840 

CAAGCTCGCT TCGAGACACC ATGATGCGAA TCAGGGTCCG GTCTTTTGTT CCTGCTCCCC 900 

CTCATAGCCT TGTTGAGCCT TTCGGCAAAG AAGGCAGGGG TGTTCTTGAG GCATTTCACC 960 

ACAGCCAGCA TGCCCTGCTC CAGGTCCCCA GACATCTCCG GCAGATGCTC TTCTCGATGT 1020 

CTCGGCCTGT CATCCTCTGA TACTCGTTGA AACT 1054 
(2) INFORMATION FOR SEQ 10 NO: 26; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3306 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 



(vii) IMMEDIATE SOURCE: 
(B) CLONE: HW040 

\ 

\ 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 481.. 2433 

(ix) FEATURE: 1 

(A) NAME /KEY: misc_feature 

(B) LOCATION: complement (2395.. 2588) 
(D) OTHER INFORMATION: /labels SAC_24520 



ixi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

GTCGACGCCT GGTGTGGCTG GCGGCACCAG CTGAGTATTC CCTGAAAGAA TGAAGATGAC 60 

TGGCCTAGAG GAGTCAGGAA TGTGACTGCT GGGAGATCCA GCTAGAGCAA CCATACCTGC 120 

TTTTCCAGGA CCTTCCCCCT AACAGCTAAG GAGCCTTAGG TCTGGGAGAG GCAGGCACCT 180 

GGCTGCAGAA GGGGAAACAA CCTTTTTCTC TTTCTCTCCT CCACTACTCT CCTGATGGCT 240 

GCAGAAGAAG AAAGATCTAC CACGGGAAAC TCGGAGAGCT AGTCGGTTAA CTGAGCTGTT 300 

GGATCTCCAC GGCACGAGGC AAATCCGTGA TGTGACTTTT TGTTTTTTCT CCTAGGTCAG 360 

CTGAGGGAAC AGCTCAGTTA CCTTAAGGGC GATAACTTTT TCAGGTTTAC TTGTGCCGAT 420 

TGCTCGGCAG ACGGCAAGGA GCAGTCCGAG AGGCTGAAGC TGACATGGCA GCAAGTGGTC 480 

ATG CTG GCT ATG TAC AAC TTG TCT CTG GAA GGA AGT GGA CGT CAA GGC 528 
Met Leu Ala Met Tyr Asn Leu Ser Leu Glu Gly Ser Gly Arg Gin Gly 
1 5 10 15 

TAC TTC CGG TGG AAA GAA GAT ATC TGT GCT TTT ATT GAG AAA CAT TGG 576 
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Tyr Phe Arg Trp Lys Glu Asp lie Cys Ala Phe lie Glu Lys His Trp 

20 25 30 

ACT TTT TTA CTA GGA AAT AGG AAA AAG ACT TCG ACG TGG TGG AGC ACA 624 
Thr Phe Leu Leu Gly Asn Arg Lys Lys Thr Ser Thr Trp Trp Ser Thr 
35 40 45 

GTA GCA GGT TGC CTC AGC GTG GGA AGT CCT GTT TAT TTC CGT TCA GGT 672 
Val Ala Gly Cys Leu Ser Val Gly Ser Pro Val Tyr Phe Arg Ser Gly 
50 55 60 

GCT CAG GAA TTT GGA GAG CCT GGA TGG TGG AAA CTT GTT CAT AAC AGA 720 
Ala Gin Glu Phe Gly Glu Pro Gly Trp Trp Lys Leu Val His Asn Arg 
65 70 75 80 

CCC CCA ACA ATG AGA CCA GAG GGA GAG AAG CT6 TCT GCC TCC ACT TTG 768 
Pro Pro Thr Met Arg Pro Glu Gly Glu Lys Leu Ser Ala Ser Thr Leu 

85 90 95 

AAA GTG AAA GCT TCA AAA CCA ACA CTG GAT CCC ATC ATT ACC GTT GAA 816 
Lys Val Lys Ala Ser Lys Pro Thr Leu Asp Pro He He Thr Val Glu 

100 105 , HO 

GGA CTC AGA AAA CGA GCA AGC CGG AAT CCT GTG GAA TCA GCC ATG GAA 864 
Gly Leu Arg Lys Arg Ala Ser Arg Asn Pro Val Glu Ser Ala Met Glu 
115 120 il25 

TTG AAA GAG AAG CGG TCT CGA ACA CAG GAA GCT, AAA GAC ATC AGA AGA 912 
Leu Lys Glu Lys Arg Ser Arg Thr Gin Glu Ala' Lys Asp He Arg Arg 
130 135 140 

GCA CAG AAA GAA GCA GCC GGC CTC CTT GAC AGA AGC ACC TCT TCT ACC 960 
Ala Gin Lys Glu Ala Ala Gly Leu Leu Asp Arg Ser Thr Ser Ser Thr 
145 150 155 160 

CCT GTC AAG TTC ATA AGC CGA GGC CGG AGG CCG GAT ATG ATT TTG GAG 1008 
Pro Val Lys Phe He Ser Arg Gly Arg Arg Pro Asp Met He Leu Glu 

165 170 175 

AAA GGA GAA GTG ATT GAC TTT TCA TCC CTG AGC TCC TCG GAC CGT ACC 10 c 6 
Lys Gly Glu Val He Asp Phe Ser Ser Leu Ser Ser Ser Asp Arg Thr 

180 185 190 

CCC CTC ACA AGC CCC TCT CCT TCT CCC TCT CTG GAT TTC TCT GCC CCA 1104 
Pro Leu Thr Ser Pro Ser Pro Ser Pro Ser Leu Asp Phe Ser Ala Pro 
195 200 205 

GGG ACA CCA GCG TCA CAT TCT GCC ACG CCT AGC TTG CTC TCT GAA GCA 1152 
Gly Thr Pro Ala Ser His Ser Ala Thr Pro Ser Leu Leu Ser Glu Ala 
210 215 220 

GAC CTC ATC CCA GAC GTG ATG CCA CCC CAA GCT CTA TTT CAT GAT GAT 1200 
Asp Leu He Pro Asp Val Met Pro Pro Gin Ala Leu Phe His Asp Asp 
225 230 235 240 

GAT GAG CTG GAA GGA GAT GGA GTC ATA GAC CCA GGG ATG GAG TAC ATC 1248 
Asp Glu Leu Glu Gly Asp Gly Val He Asp Pro Gly Met Glu Tyr He 

245 250 255 

CCA CCC CCA GCT GGG TCA GCT TCT GGG CTG ATG GGG AGC AGA AAG AAG 1296 
Pro Pro Pro Ala Gly Ser Ala Ser Gly Leu Met Gly Ser Arg Lys Lys 

260 265 270 
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GTC AGA GCT GCA GAG CAG ATA AAA CAG GAA GTC GAG AGC GAG GAG GAG 1344 
Val Arg Ala Ala Glu Gin lie Lys Gin Glu Val Asp Ser Glu Glu Glu 
275 280 285 

AAG CCA GAC AGG ATG GAT GGA GAT AGC GAA GAC ACA GAT TCA AAC ACT 1392 
Lys Pro Asp Arg Met Asp Gly Asp Ser Glu Asp Thr Asp Ser Asn Thr 
290 295 300 

TCT TTG CAC ACT AGA GCT CGA GAA AAG AGG AAG CCA CCC CTG GAG AAG 1440 
Ser Leu His Thr Arg Ala Arg Glu Lys Arg Lys Pro Pro Leu Glu Lvs 
3 °5 310 , 315 320 

GAC ACG AAG CCC AAG GGG CCC AGG TAT ACG CCT GTG AGC ATC TAT GAG I486 
Asp Thr Lys Pro Lys Gly Pro Arg Tyr Thr Pro Val Ser lie Tyr Glu 

325 i 330 335 

GAG AAG CTG CTG CTC AAG AGG CTG GAG GCA TGC CCT GGT GCC GTG GCC 1536 
Glu Lys Leu Leu Leu Lys Arg Leu Glu Ala Cys Pro Gly Ala Val Ala 

340 345 350 

ATG ACT CCA GAA GCC, CGG AGG CTG AAA CGG AAG CTG ATC GTC CGG CAA 1584 
Met Thr Pro Glu Ala Arg Arg Leu Lys Arg Lys Leu He Val Arg Gin 
35 * 360 365 

GCA AAA AGG GAT CGG GGC TTA CCA CTG TTT GAC TTG GAC GAG GTT GTG 1632 
Ala Lys Arg Asp Arg 'Gly Leu Pro Leu Phe Asp Leu Asp Glu Val Val 
370 < 375 3fco 

AAC GCA GCA CTT CTG TTA GTG GAT GGG ATT TAT GGA GCC AAA GAT GGA 1680 
Asn Ala Ala Leu Leu Leu Val Asp Gly He Tyr ,Gly Ala Lys Asp Glv 
385 390 395 400 

GGA GCT TCC CGG CTT CCA GCT GGA CAA GCT ACA TAC CGG ACC ACC TGC 1728 
Gly Ala Ser Arg Leu Pro Ala Gly Gin Ala Thr Tyr Arg Thr Thr Cys 

405 410 415 

CAG GAC TTC AGG ATC CTT GAC CGG TAC CAG ACT GCC TTA CCA GCC AGG 1776 
Gin Asp Phe Arg He Leu Asp Arg Tyr Gin Thr Ala Leu Pro Ala Arg 

420 425 430 

AAA GGA TTC CGG CAC CAG ACC ACC AGA TTT TTG TAT CGT CTG GTG GGA 1624 

Lys Gly Phe Arg His Gin Thr Thr Arg Phe Leu Tyr Arg Leu Val Gly 
435 440 445 

TCA GAA GAT CTG GCT GTG GAC CAA AGT ATT GTC AGC CCT TAC ACT TCT 1872 
Ser Glu Asp Leu Ala Val Asp Gin Ser He Val Ser Pro Tyr Thr Ser 
450 455 460 

CGG ATC TTG AAA CCT TAT ATC AGG CGT GAT TAT GAG ACA AAG CCA CCC 1920 
Arg He Leu Lys Pro Tyr He Arg Arg Asp Tyr Glu Thr Lys Pro Pro 
465 470 475 480 

AAA CTA CAG CTG CTG TCC CAG ATT CGC TCC CAC CTG CAC AAG AGT GAC 1968 
Lys Leu Gin Leu Leu Ser Gin He Arg Ser His Leu His Lys Ser Asp 

485 490 495 

CCT CAC TGG ACA CCT GAA CCT GAT GCA CCT CTC GAT TAC TGC TAT GTC 2016 
Pro His Trp Thr Pro Glu Pro Asp Ala Pro Leu Asp Tyr Cys Tyr Val 

500 505 510 

CGA CCA AAT CAC ATC CCA ACG ATC AAC TCC ATG TGT CAG GAA TTT TTC 2064 
Arg Pro Asn His He Pro Thr He Asn Ser Met Cys Gin Glu Phe Phe 
515 520 525 
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TGG CCT GGC ATT GAC CTG TCT GAG TGT CTG CAG TAT CCA GAC TTC AGT 2112 
Trp Pro Gly lie Asp Leu Ser Glu Cys Leu Gin Tyr Pro Asp Phe Ser 
530 535 540 

GTC GTA GTC CTT TAT AAA AAA GTC ATT GTT GCC TTT GGC TTC ATG GTT 2160 
Val Val Val Leu Tyr Lys Lys Val He Val Ala Phe Gly Phe Met Val 
545 550 555 560 

CCC GAT GTG AAG TAC AAC GAA GCT TAC ATT TCA TTT CTG CTT GTT CAT 2208 
Pro Asp Val Lys Tyr Asn Glu Ala Tyr He Ser Phe Leu Leu Val His 

565 570 575 

CCC GAG TGG AGG AGA GCA GGG ATT GCC ACG TTC ATG ATC TAT CAT CTG 2256 
Pro Glu Trp Arg Arg Ala Gly He Ala Thr Phe Met He Tyr His Leu 

580 585 590 

ATT CAG ACG TGC ATG GGC AAG GAT GTG ACT CTT CAC GTC TCG GCG AGC 2304 
He Gin Thr Cys Met Gly Lys Asp Val Thr Leu His Val Ser Ala Ser 
595 600 605 

AAC CCT GCC ATG CTG CTG TAC CAG AAG TTT GGC TTC AAG ACA GAG GAG 2352 
Asn Pro Ala Met Leu Leu Tyr Gin Lys Phe Gly Phe Lys Thr Glu Glu 

610 615 620 

i 

TAT GTT TTG GAT TTT* TAT GAT AAG TAT TAC CCA CTG GAG AGT ACA GAG 2400 
Tyr Val Leu Asp Phe Tyr Asp Lys Tyr Tyr Pro Leu Glu Ser Thr Glu 
625 ' 630 635 640 

TGT AAA CAT GCA TTC TTC CTG AGG CTC CGA CGC TGAGGTGAAC TCAGTTCTTC 2453 
Cys Lys His Ala Phe Phe Leu Arg Leu Arg Arg 

645 650 



CCAGCAAACA 


GCAGATGCTG 


TCAGAGAGTG 


GCCTCAGAAG 


TCGTGCAGGC 


CCTTCTCCTC 


2513 


GAAGTGGGCC 


CTGGCCTCCT 


GTGAGTAATG 


GCCTCCACAG 


GGTCCAGCTG 


GGTGAGTGAA 


2573 


GCCATGTGCA 


GAGGTACAAG 


TGGCTGTTCA 


ATCCAGAAAC 


CATCCAGCAG 


GCCCTGCGTG 


2633 


GCCTTCCAGC 


AGTATCTTGG 


ACTCCACTGT 


GGACCAGAAC 


CAAGTGACTC 


TGCCATTCTT 


2 693 


TTTCCACACA 


ATGAGTGGGC 


ACAAAAGAGC 


TTCTGAGCTG 


AAGCATTTGA 


GAATGGGCCT 


2753 


GCTCCTCCTA 


TGTGAGAGCT 


CAGGAAGTAT 


GAAAGATGTT 


AAAGACTGCC 


AGGGCCCTCA 


2813 


CTGTCCTGGG 


CCGAGAGGTT 


ATGAGGTGTC 


AGGAATGACT 


CCAACCCTGT 


CTCAGCCTGT 


2873 


GCTGTGCTGT 


CATCGAACCC 


CAGAGGATTC 


ATGGGGCACT 


GAAGTCCTTT 


TGTCTCTTAC 


2933 


TGACTTTCGT 


CCTGGTCAGG 


AAAAAGATAC 


TCGGTGTCTG 


TTTCCAGACT 


GCTGCTGCTT 


2993 


TACTTATCTA 


CCTGTCAGCA 


CCCCTTGCAG 


GGACATTGAC 


CTGGTACAAG 


AAGTTTACTC 


3053 


TTGGCATGGC 


CTGGAAAGTA 


GCCCGAGAGG 


AACGGTTGAG 


CTTGATACAT 


TAGATGCCTT 


3113 


CCCAGGCCCT 


TCTCATGCTC 


AGGACAGTTG 


CACCACAGAA 


GTGGAGGTGT 


GGAATGGCGC 


3173 


AGGCAGGCCT 


GGGCGCCCTG 


TCCACCTCAT 


GAAGCTGCTT 


CCTGGGTTGC 


ACATGTTGGA 


3233 


GTTGGACTAC 


AGGTATCTAA 


GTGCATTCCA 


AAAGCATCAG 


ATATCTCCAA 


TAAAAATCCT 


3293 


TACCATACAA 


AAA 










3306 
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(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 651 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

Met Leu Ala Met Tyr Asn Leu Ser Leu Glu Gly Ser Gly Arg Gin Gly 
15 10 15 

Tyr Phe Arg Trp Lys Glu Asp lie Cys Ala Phe' lie Glu Lys His Trp 

20 25 30 

Thr Phe Leu Leu Gly Asn Arg Lys Lys Thr Ser Thr Trp Trp Ser Thr 
35 40 45 

Val Ala Gly Cys Leu Ser Val Gly Ser Pro Val Tyr Phe Arg Ser Gly 
50 55 r 60 

Ala Gin Glu Phe Gly Glu Pro Gly Trp Trp Lys Leu Val His Asn Arg 
65 70 75 , 80 

Pro Pro Thr Met Arg Pro Glu Gly Glu Lys Leu Ser Ala Ser Thr Leu 

85 90 ' 95 

Lys Val Lys Ala Ser Lys Pro Thr Leu Asp Pro He He Thr Val Glu 

100 105 110 

Gly Leu Arg Lys Arg Ala Ser Arg Asn Pro Val Glu Ser Ala Met Glu 
115 120 125 

Leu Lys Glu Lys Arg Ser Arg Thr Gin Glu Ala Lys Asp He Arg Arg 
130 135 140 

Ala, Gin Lys Glu Ala Ala Gly Leu Leu Asp Arg Ser Thr Ser Ser Thr 
14b 150 155 160 

Pro Val Lys Phe He Ser Arg Gly Arg Arg Pro Asp Met He Leu Glu 

165 170 175 

Lys Gly Glu Val He Asp Phe Ser Ser Leu Ser Ser Ser Asp Arg Thr 

180 185 190 

Pro Leu Thr Ser Pro Ser Pro Ser Pro Ser Leu Asp Phe Ser Ala Pro 
195 200 205 

Gly Thr Pro Ala Ser His Ser Ala Thr Pro Ser Leu Leu Ser Glu Ala 
210 215 220 

Asp Leu He Pro Asp Val Met Pro Pro Gin Ala Leu Phe His Asp Asp 
225 230 235 240 

Asp Glu Leu Glu Gly Asp Gly Val He Asp Pro Gly Met Glu Tyr He 

245 250 255 

Pro Pro Pro Ala Gly Ser Ala Ser Gly Leu Met Gly Ser Arg Lys Lys 

260 265 270 
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Val Arg Ala Ala Glu Gin lie Lys Gin Glu Val Asp Ser Glu Glu Glu 
275 , 280 ' 285 

Lys Pro Asp Arg Met Asp Gly Asp Ser Glu Asp Thr Asp Ser Asn Thr 
290 295 300 

Ser Leu His Thr Arg Ala Arg Glu Lys Arg Lys Pro Pro Leu Glu Lys 
305 310 315 320 

Asp Thr Lys Pro Lys Gly Pro Arg Tyr Thr Pro Val Ser lie Tyr Glu 

325 330 335 

Glu Lys Leu Leu Leu Lys Arg Leu Glu Ala Cys Pro Gly Ala Val Ala 

340 345 350 

Met Thr Pro Glu Ala Arg Arg Leu Lys Arg Lys Leu lie Val Arg Gin 
355 360 ,365 

Ala Lys Arg Asp Arg Gly Leu Pro Leu Phe Asp Leu Asp Glu Val Val. 
370 375 380 

Asn Ala Ala Leu Leu Leu Val Asp Gly lie Tyr i Gly Ala Lys Asp Gly 
385 390 395 400 

Gly Ala Ser Arg Leu Pro Ala Gly Gin Ala Thr Tyr Arg Thr Thr Cys 

405 410 t 415 

Gin Asp Phe Arg He Leu Asp Arg Tyr Gin Thr, Ala Leu Pro Ala Arg 

420 425 , 430 

Lys Gly Phe Arg His Gin Thr Thr Arg Phe Leu Tyr Arg Leu Val Gly 
435 440 445 

Ser Glu Asp Leu Ala Val Asp Gin Ser He Val Ser Pro Tyr Thr Ser 
450 455 460 

Arg He Leu Lys Pro Tyr He Arg Arg Asp Tyr Glu Thr Lys Pro Pro 
465 470 475 480 

Lys Leu Gin Leu Leu Ser Gin lie Arc Ser His Leu his. Lys Ser Asp 

485 490 495 

Pro His Trp Thr Pro Glu Pro Asp Ala Pro Leu Asp Tyr Cys Tyr Val 

500 505 510 

Arg Pro Asn His He Pro Thr He Asn Ser Met Cys Gin Glu Phe Phe 
515 520 525 

Trp Pro Gly He Asp Leu Ser Glu Cys Leu Gin Tyr Pro Asp Phe Ser 
530 535 540 

Val Val Val Leu Tyr Lys Lys Val He Val Ala Phe Gly Phe Met Val 
545 550 555 560 

Pro Asp Val Lys Tyr Asn Glu Ala Tyr He Ser Phe Leu Leu Val His 

565 570 575 

Pro Glu Trp Arg Arg Ala Gly He Ala Thr Phe Met He Tyr His Leu 

580 585 590 



He Gin Thr Cys Met Gly Lys Asp Val Thr Leu His Val Ser Ala Ser 
595 600 605 
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Asn Pro Ala Met Leu Leu Tyr Gin Lya Phe Gly Phe Lys Thr Glu Glu 
610 615 , 620 

Tyr Val Leu Asp Phe Tyr Asp Lys Tyr Tyr Pro Leu Glu Ser Thr Glu 
625 630 1 635 640 

Cys Lys His Ala Phe Phe Leu Arg Leu Arg Arg 

645 650 

(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: , 

(A) LENGTH: 4234 base pairs 

(B) TYPE: nucleic acid > 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : cDNA 
(iii) HYPOTHETICAL: NO 



(vii) IMMEDIATE SOURCE: 
(B) CLONE : HW041 

(ix) FEATURE: ' 

(A) NAME /KEY: misc_f eature 

(B) LOCATION: 3 407.. 3 482 

(D) OTHER INFORMATION: /label* SAC„24317 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

GTCGACCCTC CTTCTGCCCT GGCTCCTGTT GATGAGCACA GTGCCGTCCC TGATCAGTTC 60 

i 

TTTCTTCCTG TGAATTTGCT TCGTAGCCCA TCAGGCTGGT AGACTGGCAG CACGGTCAGT 120 

GTCACACGAG GGACAGGACT TCCCTGAAGT GAAGCAGCAA TGTCATGGAA GCAAACCCCT 180 

GTGAGCGAGA GACAGGCCCC GAGGCTGCTG CCACAAGGTC TCATGTTCAT TGCTGGGGAC 24 0 

ACTAGCTTCC TACAAACCCC AGGAGACGCT GCCCATAGCA CGGGTGGCCC TTCAGGCTCC 300 

TGAACTATCT ATCCCCAGAT CAGTAGGATT TGTTTTCCAA GTCCCCTTCC TCATTCCTTC 360 

TTTGCTCAAG AGAAAAAAGA AAAGTGGTAA CTCTGTGACT CTAAGTCCCT CTTAAATGCA 420 

GCGCTGAATC TAGCTTGGCT CGGAGATCTC TGACTTGTCC CTAGGGTGAG ACTGTGGCCT 480 

CCCCAGGTTG AGGTGGCACG TGTATTGTGA GAGATTAGCC TGGAGAGCAG AGAAGGGGGG 540 

AGGGGGGATA GGGAGGAGAG AGCAATCGTG TCTCCCTACA GGACCCTGGG CCATCTGACA 600 

AGACAGAGGA GGTGGGACTG GCTTCCCCAT GCTGCACATG GCAGTGGGAA CCAGGCTTGG 660 

AGTCCCCCAC CCCCTCCTTA CCTCTTCTCT CAGACCTTGT AGCCCTAGCT TGCCCTGTGC 720 

TCTGCAGGGA AGGGGATGAG AGGACAGCAT AAGAAGTCTT GCCAGTTGAG GTGGGGCTGG 780 

AAGATGAAGC AGGTGGGTGT AGGAATCATT TTGGATAGGG CATGTTGTTT GGTTGACGGC 840 

AACTAATTCT CTTAAGAGCA TATTGTCTCC AGTTAATGTT GGAGACAACC CAGAAAGGGA 900 
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AGGGTATAGA GATGGGTAAT 


TAATGAGATA 


TTCCAGCCCT 


GGAGCAATGT CTGAGCCACC 


960 


CCAAACTTAfci 




AGCATAGGAG 


t 

TGGGGCACTG 


CCCCTTACTC AAATTCTAGT 


1020 


CTAAGTbTIv 


CAuAL.Av*Al 1 


GTACACTTGG 


i 

ACAGGAAATC 


TCCTGTTCCA GCATACCCTC 


1080 


TCCCCCAlAt- 




ATACCCCCTC 


CCCCATACCT 


TCTCCCCCAT ACTGTTAGTA 


1140 


CGTTTCACj 1 1 


ull UAAAAvC 


AAAGGGGGCG 


CAGACTAGGC 


TTTAACTGCA TGTGGAGTGT 


1200 


AGATTTTGCC 


TAAGAATCCA 


TGTTGAAGAA GAGAGAGGCT ATGTCTTTAA GGTCGAGGCC 


1260 


CTTCAGTCTT 


CTGTCCTTGT 


AATTCTCTTG 


AAGCGTGAGC 

t 


CCCAGGCACT CAGAAGCCAT 


1320 


GTTGTACTCC 


GGGAAGTTCA 


TGGGACCTGA 


ACTCCTTCAA 


CACCGGGAAC AACAGTACTG 


1380 


cataggtact 


TTGGTACTGA 

i i 


CTGTGCAAGC 


ACTAGAAAAA 


GTACCCGTTG AATGGCCCTC 


1440 


acagcatcag ggaaacgagg ccgagttcac tctcctgatt 1 gctcagatgc agaagctgga 


1500 


cccgtgaggt 


GTACCAGCCC 


i 

ACCTCCAAGC 


TTGACACAGA 

1 < < 


GGCTAGCACT TAAAGCAAAG 


1560 


actcagaaga 


GGAGTCTCAG 


TCACAGGCTG 


CACAGCCTTG 


AACATGATAT CTAGAACTAT 


1620 


gcctgggtga 


TCGGTGGCCC 


AACTTGTGAC 


ACTTGTGTGC 

i 


ACATAGGGAA GCTGACGCTC 


1680 


ctctcggggc 


CTGAAAACAG 


GACTAAACCC 


4 % tot «m a*** to MP1 v 

AAACCCCATA 


! 

TAACTAACCA TACCACTGCC 


1740 

i 


ATCCACAGGC 


GTGGATGGAA 


GTCTTCACGG 


TCTGTTACAA 


« .M* * M ^WMA to to to to AAAAS MA) to 

GCACAAGTAC AAAGCCGATA 


1800 


AAAATAAGTC 


AAAGCCCTTC 


CTCGTGGTGC 


AGAGGGCTCC 


ma ma ^» to ^Nt to mh jai ntMpnnAM to *to^^ 

TCCTCAGATC TCTTGGAGCC 


1860 


tttgatcata 


GAGCTTGACG 


GTCTCCTCTC 


•*A).*Al>A to FT* to to to ^N^t 

GGGATAAAGG 


i 

to jAt to AAfHl to to /«MMA)inMjMMjMjlf1 

ACAGGTAAAG GCTGTGGCCT 


1920 


tcagcggtga 


CTGCCCCACA 


CCCTGCTTTC 


CTTAGGGGTG 


S+* to /A] to jAi to to to, to 

GAGACAAGGA ATGGTGGGGC 


1980 


cagtgttgtt 


CAGAGAGAAG 


TCTCTCTTTG 


GAGCCCATGT 


ATGGGGCTTG ACCTTTTGCT 


2040 


gtgagtgcat 


TTAGAGGCAA 


GTGAATTCTG 


GCCTCAGTAC 


AGAGAACCCA GCGGTTTCAG 


2100 


cactcaagaa 


AGAAGCCGCA 


GTGTCCCCTA 


CAGAGAGGCT 


CTGTCCTGCT TGGCCCCAGC 


2160 


cacctgcgcc 


ACACATTTAA 


TAAAGTGGCT 


GAAACTCCTC 


CTTTCCAAGA GCTAGTGTGC 


2220 


tgggtcctgt 


CCTCCCCACA 


GCCTTCTGAG 


TCCCAGAGAC 


TTGTGTGAGC AGTCGGATAA 


2280 


GGCCAATCTG 


GGAGACAGGG 


GATGACCATG 


ACACAAGGTG 


TTTCCAGGCA GGTCCCTTGT 


2340 


GCGTAGGCTG 


CAGAGAGCTG 


TCCTGACTGG 


GAGCTTGGGT 


TGTACGAACA CATGGTGCCT 


2400 


TCTTTGTCTG 


TGCTGAACGA 


GTAGGGTGTG 


TTTCTCTGTG 


TTCTCTGAGT AAGATCACAC 


2460 


CCTCTTTAAG 


CAAAGGGGTG 


TTGTGGATAT 


TAGAAGCAAG 


ATTGATAAAT GTTCTTAAAT 


2520 


GGTGGTGCTA 


GAGAAAAGTC 


AGTCCCCGTA 


ACTTCCTAAA 


AGCTGAGTCA GCCAGTCTCT 


2580 


AAGTATGTAC 


ATGTTGGCTT 


TCTTCTATTT 


TTTTTCTAGT 


TAATCAGTGT GTTTTAAATG 


2640 


TTTTTAGTTG 


ATGTGAGGAT 


TTTTTTAAAG 


GCATTTGTGT 


GTGGTGTGGC TCCATGGGAG 


2700 


ATAGGCTGTG 


TAATAAGAAC 


CCTCAGAAGC 


CAGCCAGTCA 


TCCAAGGGGC GCCCAAGTTC 


2760 


AGTGGATTTC 


TTCAGGAATT 


GGAGAGCAGT 


ATTTTTCCCT 


CTTTTCAGTA GTATGCCTGT 


2820 
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TCTCATTTAG 


TAATTTAATG 


GTATATTGCA 


TTATATGTGA 


ATGCATACAT 


AAACGGTGGA 


28B0 


ATTTCTCTTA 


ATATCCAAAA 


AAGTCCTTCT GTTGCCCTCT 


GAGCTCTCTC 


CTTGGGGAGG 


2940 


ACTGCAGGGC 


TCACTGTGTA 


AAATGGTTGT 


GGACTCCTCT 


AGCAGAATGG 


CTTCCAAGGC 


3000 


CTGGCCCATT 


CCTCAGATTC 


CCTTTGAAGT 


GAGAGGTCAT 


TGTCTTTTGA 


GAATGGATTT 


3060 


AAGTTTTGGA 


AATACCCCAC 


GCTGTTCACA 


GTGCAAGCGA 


CTGAACACAG 


AACAATTAGT 


3120 


CAGAATCAGA 


ATCCCAGGGC 


CATCAAGATG 


ACTCCACGGA 


AAAGGGACTG 


CCACCAACCA 


3180 


GAGCCAAGAC 


CTGAGTTTGG 


TCCCCAGGAA 


CCCCATGGCA 


GCAGGAGAGA 


GTCAGCCGAT 


3240 


GCAAGTGTCA 


CACACACACA 


TAAACACACA 


CATACACATG 


'CAACACATAC 


ACACAAACAC 


3300 


ATACATACAC 


ACACACAAAC 


ACATACACAC ATGCACACAC ACACACACAC 


ACACACACAC 


3360 


AGAGTTTTTG 


GAAGAATATC 


TATGATGTAG 


GAAAAGGTTT 


GTTAGTACCT 


GCAACCTCTA 


3420 


GGGCCTCATT 


TCCAAGCCTG 


CTGCTTCTCT 


TGCCAGGACT 


TfATACGTGTG 


CAACCCCTGC 


3480 


GTACACTACA 


GACCTAAGTA 


CATACTCGCT AACATGAACA CCCAGTAGGT 


ATTCTGTAAA 


3540 


ACGGATATAC 


TTAAGTGCAT 


GTTATTGTCG TGTGTGCCGC GACTGAGCTG 


TGTCTTTGTC 


3600 


ATGAAACTTT 


GCTTCTTTGC 


CAAATTAAGA 


CTCTGCCCTC 


CCTCCGAGGA 


AGGAAATATA 


3660 


CAGCAGTTTC 


TGGAATCAGC 


TTCTCAAGTT 


TTCCCGCCTG 


TGTTGTGTTT 


ACTTGTGGCT 


3720 


TTGCCTGAGA 


GTCTGTCCTC 


ACGTACACAC 


CTGTGAGTAT 


TTATTACAAG 


GCAGAACGGG 


3780 


GACAGGCGTA 


GCCATAGTAC 


TTGTGTTTGA 


ATGTTTCCAG 


GGAGGGACAA 


TGGAGCCCCC 


3840 


TCCTCAGTGA 


GGTGCTCACA 


GAATCTGTGT 


CAAATTCTGT 


GAACGGGGTC 


CAACACTAGT 


3900 


ACCAGCTTCA 


CCCCAACCCT 


GAGTGCCCTG 


GAGCCACTGC 


CCTCTCTACT 


GCCTTGTGTG 


3960 


TGCATGGGTG 


TGGTTGTGTG 


TGTCTCTGTT 


GGGGCATGTG 


CCGCATTTCG 


CAGTGGACTT 


4020 


TGGTATCCAC 


ATGTAACCTT 


TGGCATCTGT 


GTTCCTGGAC 


CGCATTGACA 




vvuv 


GTGCAGATCT 


GTGTAGATGG 


GTCTGTGTGC 


AAAGTGGGGA 


GTCCCCTGAA 


AGTGACTATG 


4140 


GATATTGTTG 


GTCAGCCAAA 


GACTTTCCTA 


TTCTTTGCTG 


CTTAAACTTT 


GTGCCTTAAT 


4200 


ATTGTAAATA 


TAAACGGTTA 


AAACAAAAGT 


CGAC 






4234 


(2) INFORMATION FOR SEQ ID NO; 29 


• 
• 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 1792 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 

(vii) IMMEDIATE SOURCE; 
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(B) CLONE: HW042 

r 
I 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 8.. 889 

(ix) FEATURE: 

(A) NAME/KEY: misc_£eature 

(B) LOCATION: complement (1409.*1618) 
(D) OTHER INFORMATION: /label* SAC_24017 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:29: 

GTCGACC AAG GAT GGT GGA AGT TTC AGT GGT ACT GGT GGC CTT CCA GAT 49 
Lys Asp Gly Gly Ser Phe Ser Gly Thr Gly Gly Leu Pro Asp 
1 5 10 

TAT TCT GCC CCC AAT CCC ATT AAA GTG ACC CAT CGG TGC TAC ATC CTT 97 
Tyr Ser Ala Pro Asn Pro lie Lys Val Thr His Arg Cys Tyr lie Leu 
15 i 20 25 30 



GAG AAT GAC ACA GTC CAG TGT GAC TTG GAC CTG TAC AAG TCC CTG CAG 145 
Glu Asn Asp Thr Val Gin Cys Asp Leu Asp Leu Tyr Lys Ser Leu Gin 

35, ' 40 45 

GCT TGG AAA GAC CftC AAA TTG CAC ATC GAC CAT GAG ATT GAA ACC CTG 193 
Ala Trp Lys Asp His Lys Leu His lie Asp His Glu lie Glu Thr Leu 

50 55 60 

CAG AAC AAA ATT AAG AAC CTT CGA GAA GTC AGG GGT CAC CTG AAG AAG 241 
Gin Asn Lys lie Lys Asn Leu Arg Glu Val Arg Gly. His Leu Lys Lys 
65 ,70 75 

AAG CGG CCA GAA GAA TGT GAC TGT CAC AAA GTC AGT TAC CAC AGC CAA 289 
Lys Arg Pro Glu Glu Cys Asp Cys His Lys Val Ser Tyr His Ser Gin 
80 85 90 

CAC AAA GGC CGC CTC AAG CAC AAG GGC TCC AGC CTG CAC CCT TTC AGG 337 

His Lys Gly Arc Leu Lys His Lys Gly Ser Ser Leu His Pre Phe Arc 
95 100 105 110 

AAG GGT CTG CAG GAG AAA GAC AAG GTG TGG CTG TTG CGG GAG CAG AAA 385 
Lys Gly Leu Gin Glu Lys Asp Lys Val Trp Leu Leu Arg Glu Gin Lys 

115 120 125 

CGC AAG AAG AAA CTG CGC AAG CTG CTG AAG CGA CTG CAG AAC AAT GAC 433 
Arg Lys Lys Lys Leu Arg Lys Leu Leu Lys Arg Leu Gin Asn Asn Asp 

130 135 140 

ACG TGC AGC ATG CCG GGC CTC ACG TGC TTT ACT CAC GAC AAC CAC CAC 481 
Thr Cys Ser Met Pro Gly Leu Thr Cys Phe Thr His Asp Asn His His 
145 150 155 

TGG CAG ACC GCG CCG CTC TGG ACG TTG GGA CCC TTC TGC GCC TGC ACC 529 
Trp Gin Thr Ala Pro Leu Trp Thr Leu Gly Pro Phe Cys Ala Cys Thr 
160 165 170 

AGC GCC AAC AAT AAC ACG TAC TGG TGC CTG CGG ACC ATC AAC GAG ACC 577 
Ser Ala Asn Asn Asn Thr Tyr Trp Cys Leu Arg Thr lie Asn Glu Thr 
175 180 185 190 

CAC AAC TTC CTC TTC TGT GAA TTC GCA ACT GGC TTC ATA GAA TAC TTT 625 
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His Asn Phe Leu Phe Cys Glu Phe Ala Thr Gly Phe lie Glu Tyr Phe 

195 200 205 

GAC CTC AGT ACG GAC CCC TAC CAG CTG ATG AAC GCG GTC AAC ACA CTG 673 
Asp L u Ser Thr Asp Pro Tyr Gin Leu Met Asn Ala Val Asn Thr Leu 

210 215 220 

GAC AGG GAC GTT CTC AAC CAG CTG CAC GTC CAG CTC ATG GAG CTA AGG 721 
Asp Arg Asp Val Leu Asn Gin Leu His Val Gin Leu Met Glu Leu Arg 
225 230 235 

AGC TGC AAA GGC TAC AAG CAG TGT AAC CCC CGG ACC CGC AAT ATG GAC 769 
Ser Cys Lys Gly Tyr Lys Gin Cys Asn Pro Arg Thr Arg Asn Met Asp 
240 245 250 

CTG GGG CTT AGA GAC GGA GGA AGC TAT GAG CAA TAC AGG CAG TTT CAG 817 
Leu Gly Leu Arg Asp Gly Gly Ser Tyr Glu Gin Tyr Arg Gin Phe Gin 
255 260 265 270 

CGT CGA AAA TGG CCA GAA ATG AAG AGA CCC TCT TCC AAA TCA CTG GGA 865 
Arg Arg Lys Trp Pro Glu Met Lys Arg Pro Ser Ser Lys Ser Leu Gly 

275 280 ( 285 

CAG CTA ,TGG GAA GGT TGG GAA GGC TAATCGGCCA TGGAGAGGAA CCTCCAAAAC 919 
Gin Leu Trp Glu Gly Trp Glu Gly 

290 



CAGAGGCCTC 


CTGTGGCTGC 


CCAGGCGTGC 


AAAAATCACC 


CAATTCCGAG 


GAGATCGATG 


979 


CTGGAACTGG 


GAGACTTGAC 


AGGAGGCAGG 


GTCTGCACTT 


GGGACAGGAA 


ATCCCGAGGA 


1039 


GAACGCCAAG 


ACTTCCAGAG 


GCTCATTGTT 


TTGCCCCGCT 


TTGCTTCGGA 


TTGAACCTCA 


1099 


CCGGCCGCAC 


AAGGATGCGT 


CCTGACACCT 


GGAGTCTCCG 


CTCACCCTTT 


CAGAGGCTCA 


1159 


CCAAGACAAA 


GGAACTAATT 


CCCATGGACT 


TTCCTCCAGA 


GATGGAAATT 


GCTGGGATTT 


1219 


ACCCATCCCC 


TCCCCTGTAC 


CCCCTTCCCC 


ACTAGGGAAA 


CAAGCTTGTT 


TTAACCTTCT 


1279 


TATTCTTTGG 


AGAAAGCACG 


GATGTCCCAG 


GTGCTGTCAA 


CAGCGCAGTC 


TTGACAATGG 


1339 


TCTATAGCAC 


AAAACAGTAC 


CATTCACCTG 


GCTGGATGAG 


CCGGCTGCCC 


CAGAAGCTGC 


1399 


CTTCACTGTA 


CATATGTGAC 


TGCTCACATG 


TAACCAACAC 


AGGGACTTGT 


AGGGGGAATC 


1459 


TCACTAATAC 


GAAATCCCAT 


TTTCAAGAGT 


CGCGGTGTCA 


ATAAACACTC 


TGTGGCTGGT 


1519 


GTCAAGGGTA 


ATCCCTTGAG 


CTTTCAGACA 


TTTCTGTTCC 


TGCCCAGGAT 


TCGTTCCTTT 


1579 


GTTATCCATA 


TCCATCCCAG 


AACTGATGTT 


TTCTAAGGTA CTGAAACCCC AAGTTGATGT 


1639 


GTGTCCTGTG 


TTTTAATGAC 


ATTGTATTTG 


TAAAGCAATT 


TTGTAGTATA 


AAAGTACCAT 


1699 


CTTCCAGTGT 


TCACATCCCC 


CCCAGCCAAT 


GACTAGCTCT 


TGG T ATG AAA AAAAAAATCT 


1759 


TTGAATTTTT 


GTAAAAAAAA 


AAAAAAAGTC 


GAC 






1792 



(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 294 amino acids 

(B) TYPE: amino acid 
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(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:30; 

Lys Asp Gly Gly Ser Phe Ser Gly Thr Gly Gly L u Pro Asp Tyr Ser 
1 5 10 15 

Ala Pro Asn Pro lie Lys Val Thr His Arg Cys Tyr He Leu Glu Asn 

20 25 30 

Asp Thr Val Gin Cys Asp Leu Asp Leu Tyr Lys Ser Leu Gin Ala Trp 
35 40 45 

Lys Asp His Lys Leu His He Asp His Glu He' Glu Thr Leu Gin Asn 
50 55 6,0 

Lys He Lys Asn Leu Arg Glu Val Arg Gly His Leu Lys Lys Lys Arg 
65 70 75 80 

Pro Glu Glu Cys Asp Cys His Lys Val Ser Tyr His Ser Gin His Lys 

85 90 95 

Gly Arg Leu Lys His Lys Gly Ser Ser Leu His Pro Phe Arg Lys Gly 

100 105 i 110 

Leu Gin Glu Lys Asp Lys Val Trp Leu Leu Arg Glu Gin Lys Arg Lys 
115 120 , 125 

Lys Lys Leu Arg Lys Leu Leu Lys Arg Leu Gin Asn Asn Asp Thr Cys 
130 135 140 

Ser Met Pro Gly Leu Thr Cys Phe Thr His Asp Asn His His Trp Gin 
145 150 155 160 

Thr Ala Pro Leu Trp Thr Leu Gly Pro Phe Cys Ala Cys Thr Ser Ala 

165 170 175 

Asn Asn Asn Thr Tyr Trp Cys Leu Arc Thr lie Asn Glu Thr Hie Asn 

180 185 190 

Phe Leu Phe Cys Glu Phe Ala Thr Gly Phe He Glu Tyr Phe Asp Leu 
195 200 205 

Ser Thr Asp Pro Tyr Gin Leu Met Asn Ala Val Asn Thr Leu Asp Arg 
210 215 220 

Asp Val Leu Asn Gin Leu His Val Gin Leu Met Glu Leu Arg Ser Cys 
225 230 235 240 

Lys Gly Tyr Lys Gin Cys Asn Pro Arg Thr Arg Asn Met Asp Leu Gly 

245 250 255 

Leu Arg Asp Gly Gly Ser Tyr Glu Gin Tyr Arg Gin Phe Gin Arg Arg 

260 265 270 

Lys Trp Pro Glu Met Lys Arg Pro Ser Ser Lys Ser Leu Gly Gin Leu 
275 280 285 

Trp Glu Gly Trp Glu Gly 
290 
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(2) INFORMATION FOR SEQ ID NO; 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 2261 base pairs, 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: HW043 

(ix) FEATURE: 

(A) NAME/ KEY; CDS 
<B) LOCATION: 8.. 634 

(ix) FEATURE: 

(A) NAME /KEY: misc_£eature 

(B) LOCATION: 582.. 808 

(D) OTHER INFORMATION: /label* SAC_24533 

i 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 1 

GTCGACG GCT CCA CCA GGT TCA AGA CAC AAA CCT GTG CCT ACC AAG TCA 49 
Ala Pro Pro Gly Ser Arg His Lys Pro Val Pro Thr Lys Ser 
15 10 

ACA CCA GCT GGC ACA GAA CTA GGT AGC ACA GGA AGC GAC GGG GGA GAT 97 
Thr Pro Ala Gly Thr Glu Leu Gly Ser Thr Gly Ser Asp Gly Gly Asp 
15 20 25 30 

GCA GTA GAA TAC AAC CCC AAC CTC CTG GAT GAC CCA CAG TGG CCG TGT 145 
Ala Val Glu Tyr Asn Pro Asn Leu Leu Asp Asp Pro Gin Trp Pro Cys 

35 40 45 

GGG AAG CAC AAG CGT GTA CTC ATC TTT GCC TCA TAC ATG ACC ACT GTT 193 
Gly Lys His Lys Arg Val Leu He Phe Ala Ser Tyr Met Thr Thr Val 

50 55 60 

ATA GAG TAT GTG AAG CCT GCA GAC CTC AAA AAG GAC ATG AAC GAG ACC 241 
He Glu Tyr Val Lys Pro Ala Asp Leu Lys Lys Asp Met Asn Glu Thr 
65 70 75 

TTC AGG GAG AAG TTC CCT CAT ATC AAA CTG ACA TTG AGC AAA ATT AGG 289 
Phe Arg Glu Lys Phe Pro His He Lys Leu Thr Leu Ser Lys He Arg 
80 85 90 

AGT TTA AAA CGG GAG ATG CGG AAC CTT TCT GAA GAG TGC AGC TTG GAG 337 
Ser Leu Lys Arg Glu Met Arg Asn Leu Ser Glu Glu Cys Ser Leu Glu 
95 100 105 110 

CCT GTG ACT GTG TCC ATG GCC TAT GTG TAC TTT GAG AAG CTT GCA CTG 385 
Pro Val Thr Val Ser Met Ala Tyr Val Tyr Phe Glu Lys Leu Ala Leu 

115 120 125 

CAG GGC AAG CTC AAC AAA CAG AAC CGC AAA CTG TGT GCT GGC GCC TGT 433 
Gin Gly Lys Leu Asn Lys Gin Asn Arg Lys Leu Cys Ala Gly Ala Cys 

130 135 140 
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GTT CTG TTG GCT GCC AAG ATC AGC AGT GAC CTC CGC AAG AGT GAA GTG 481 
Val Leu Leu Ala Ala Lys lie Ser Ser Asp Leu Arg Lys Ser Glu Val 
145 150, 155 

AAG CAG CTT ATT GAC AAG CTG GAG GAA AGG TTC CGG TTC AAC AGG AAG 529 
Lys Gin Leu lie Asp Lys Leu Glu Glu Arg Phe Arg Phe Asn Arg Lys 
160 165 170 

GAT CTC ATT GGA TTT GAG TTC ACG GTG CTT GTG GCT TTG GAA CTG GCC 577 
A Asp Leu lie Gly Phe Glu Phe Thr Val Leu Val Ala Leu Glu Leu Ala 
1 175 180 185 190 

CTG TAG CTC CCG GAG AAC CAA GTA TTA CCT CAC TAC AGA CGC CTC ACC 625 
Leu Tyr Leu Pro Glu Asn Gin Val Leu Pro His Tyr Arg Arg Leu Thr 

195 200 205 

CAG CAG TTC TAGCCCAGAA CAGTGCACCC CACAGGGAGC CTGCTTTGGC 674 
Gin Gin Phe 

AGGAAGGCGA GTAGCTTGGC TAGAGCACTG ACTAGTA6TG TCCCTGTCCT TCCTCTAGTA 734 

ACTGCACTTG CCCCTCAGTC CAGGGCTGCA CTTAGCTTCC CTTGGAAAGA GGCACTGCTC 794 

TTATTTTGGA GTGTACCCCA TCTTCACCTT GAG<3CCCTGT TGAGGGTTTT CCTGAAGCAA 854 

■I ' 

ATGAACATCG TGGGGCTTGG TTTGCTTTTC ATGTGCCTGA GACCAGCCTG GTTTATGTTG 914 

TCAATCCTGA GGTCCAGCTT TCATTTTGCC TAGCCATGAA ACCATTCCTG TAGAGACCTA 974 

GGTGTCCAGA CCTTCTGAAC ATACTGGAAT TGCCAGGCTT CTGTCTCCTG GTGCCTCCAC 1034 

CCAGGCAGAG CTCCCC£ATC AGCACTTTTT TTTGGTCAAG ATGCAGGAAA GGGCAGTCTG 1094 

CTCACACTGG ACCTTGTTAA AGCATCTGCC ACTGTGAAAC CCCTGTGCCA CCACATACAG 1154 

TGTTTGGCCA ACACCACCAC CCACCCCAAC CCCTGCCTCC TGCCAGCACT TATCCCTGCG 1214 

GTACTGGACC ATTTAAGACA ACCGCTGACT GCGGGCTCTT GACTTTCCTG AGCAATATCA 127 4 

TGATGGAGAA AAGGTATTAA CTTGAGATTT GGTTTCTATA ACAGAAAAGA CGGCCCATTA 1334 

AGGCCTGCTG CTGACTCTGC ACCACACTTG ATCCTCAAGC AGGAAGGTAC TGACCCTGCC 1394 

ACCTAACAGC ACATCAGTGA CTCCTCACAA ATAGATGCCA TCCAGGCCAA CTTGCCTGTG 1454 

TATTGACTGT GCCACTGTCT TCCTACTGGC ACCAGACAAG AGCCATCCTC CACCTTCCCA 1514 

GGCCTGTTAT TCCCTACAAC TGGTCACCAG CCTGGCAGAT GCCAGTCCCA CTGCTCTCAA 1574 

GTGCTTGGTT CCACAGCCAC TCATCCCAGA AATGCTTACC CTCTCAAGCC TGACACCACT 1634 

CAACTACCTA ATGGATGTTA GTAGCTTAAA CAGAGACCTT GAAGGATAGT AGAAGGATAG 1694 

TAAACTCTAG CCTTCTACCC TGCCCATGCT TATGCTGGTG TAAGCACGCT ATCCTTAGAG 1754 

CACCTGGGTG CCATCTGCAG ACAGTGATGT CTTACTTAAA ATGAACTTTA GATACAGCTG 1814 

ACTGGTCGGC TATCCCTGGG GGTGATACCC TGGTAGCTTA TTCTAGTTAT GTTAAAGCTG 1874 

GCAACAGTGA TAATTTAAGC TTCTATATTT ATGATCTTAC CAAGCTGGTG TGCTGTTAGT 1934 
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CCTTCCCTCC TTGCCCTCAG TAATGCTAAG AGCTGCTGAC ATGTGGTATG ATAAGCACTG 1994 

TTACAACAAA ACCAACCTTT AAACCAAATG ACAGGCTAAA CACAGTGTCT GTCCTAGACT 2054 

AAGTCCAGAA ATCCGAGGTG TGCCCACGGT GTCCAGCTGG ACTGTGACCC AATGCCACAG 2114 

ACCTGGTGCT GTACCTTCAG CACTCAACAG GCACAGCTGT GTGTGTGACC TCAGAATGCT 2174 

GCTTATGTAT GTCCAGCCGG TTCTGAATGG AAGGTGTTAA ATGTTTTTGT TAAAAGTAAA 2234 

ACTTGTTTTT GCAAAAAAAA AGTCGAC 2261 

(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: ' 

(A) LENGTH: 209 amino acids 

(B) TYPE: amino acid ' 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 

Ala Pro Pro Gly Ser Arg His Lys Pro Val Pro Thr Lys Ser Thr Pro 
1 5 10 i 15 

Ala Gly Thr Glu Leu Gly Ser Thr Gly Ser Asp Gly Gly Asp Ala Val 

20 .25 1 30 

Glu Tyr Asn Pro Asn Leu Leu Asp Asp Pro Gin Trp Prp Cys Gly Lys 
35 40 45 

His Lys Arg Val Leu lie Phe Ala Ser Tyr Met Thr Thr Val He Glu 
50 55 60 

Tyr Val Lys Pro Ala Asp Leu Lys Lys Asp Met Asn Glu Thr Phe Arg 
65 70 75 80 

Glu Lys Phe Pro His He Lys Leu Thr Leu Ser Lys He Arc Ser Leu 

85 90 95 

Lys Arg Glu Met Arg Asn Leu Ser Glu Glu Cys Ser Leu Glu Pro Val 

100 105 110 

Thr Val Ser Met Ala Tyr Val Tyr Phe Glu Lys Leu Ala Leu Gin Gly 
115 120 125 

Lys Leu Asn Lys Gin Asn Arg Lys Leu Cys Ala Gly Ala Cys Val Leu 
130 135 140 

Leu Ala Ala Lys He Ser Ser Asp Leu Arg Lys Ser Glu Val Lys Gin 
145 150 155 160 

Leu He Asp Lys Leu Glu Glu Arg Phe Arg Phe Asn Arg Lys Asp Leu 

165 170 175 

He Gly Phe Glu Phe Thr Val Leu Val Ala Leu Glu Leu Ala Leu Tyr 

180 185 190 

Leu Pro Glu Asn Gin Val Leu Pro His Tyr Arg Arg Leu Thr Glri Gin 
195 200 205 
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(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1538 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 
(iii) HYPOTHETICAL: NO 



(vii) IMMEDIATE SOURCE: 
(B) CLONE: HW044 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 24.. 1100 i 

(ix)' FEATURE: 

(A) NAME /KEY: misc.feature 

(B) LOCATION: complement (1161.»1378) 
(D) OTHER INFORMATION: /labels SAC_24216 

i 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

GTCGACCTCC ATAGAGTTCC GGC ATG AAG CCA CCG CAG AGA CGG CGG AAG 

Met Lys Pro Pro Gin Arg Arg Arg Lys 
1 5 

GTC CCC GCA CGC TAC ACA GGT GAA GCT ACC GGC CCC ACA GCT TGG AGC 
Val Pro Ala Arg Tyr Thr Gly Glu Ala Thr Gly Pro Thr Ala Trp Ser 
10 15 20 25 

CCC CGC GAG ATG CGG CAC CTA CTG AGA CTA CTC CAG GCT CGG CGC GGC 
Pro Arg Glu Met Arg His Leu Leu Arg Leu Leu Gin Ala Arg Arg Gly 

30 35 40 

CAG CCG GAG CCA GAC GCC AAG GAG CTG GCC AAG GAG CTG CGA GGC CGG 
Gin Pro Glu Pro Asp Ala Lys Glu Leu Ala Lys Glu Leu Arg Gly Arg 

45 50 55 

AGC GAG GCC GAG ATC TGT CGC TTT ATC CAG CAG CTC AAA GGC CGG GTG 
Ser Glu Ala Glu lie Cys Arg Phe lie Gin Gin Leu Lys Gly Arg Val 
60 65 70 

GTT CGA GAG GCT ATT CGG AAG GTG CAA CCA GGT GGC AGA GAT GGT CCA 
Val Arg Glu Ala lie Arg Lys Val Gin Pro Gly Gly Arg Asp Gly Pro 
75 80 85 

AGG CAT CGA GAG ACA CAG CTC CCA GCC CCC ATA GAG GTA TGG ATG GAT 
Arg His Arg Glu Thr Gin Leu Pro Ala Pro lie Glu Val Trp Met Asp 
90 95 100 105 

CTT GCT GAG AAA TTA ACA GGC CCA CTG GAG GAA GCC CTG ACT GCA GCT 
Leu Ala Glu Lys Leu Thr Gly Pro Leu Glu Glu Ala Leu Thr Ala Ala 

110 115 120 
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TTC TCC CAG GTA CTC ACC ATT GCT GCT GCG GAA CCT CTC AGC CTC CTG 434 
Phe Ser Gin Val Leu Thr lie Ala Ala Ala Glu Pro Leu Ser Leu Leu 

125 130 135 

CAT TCC AAG CCA GGC AAG CCC ACA AAG, GCC TGT, GGA AGG GCA CTG GTG 482 
His Ser Lys Pro Gly Lys Pro Thr Lys Ala Cys Gly Arg Ala Leu Val 
140 145 150 

TTC TTG AGC ACC CAA GAT GCG CAG AAG GAT TCT GCC CCT GAA GGT TCT 530 
Phe Leu Ser Thr Gin Asp Ala Gin Lys Asp Ser Ala Pro Glu Gly Ser 
155 160 165 

GGG CCT GAA CCT ATG ACT GCT GCT GAC CCC ACT CCT GAG GCC TCT GTC 578 
Gly Pro Glu Pro Met Thr Ala Ala Asp Pro Thr Pro Glu Ala Ser Val 
170 175 i 180 185 

CCT GAA CCT AAG GCC TCC GGT ACC AAC CCT GAG ACC TCT GGC CTT GTC 626 
Pro Glu Pro Lys Ala, Ser Gly Thr Asn Pro Glu Thr Ser Gly Leu Val 

190 195 200 

CCA GAG GTC TCT GTC QCT GAC CCT GAT GCA CCA ACT GAG TCC CTG GCT 674 
Pro Glu Val Ser Val Pro Asp Pro Asp Ala Pro Thr Glu Ser Leu Ala 

205 210 215 

i 

GGA TCC TCC ACA GAG AGA GAC TTT GCC GTG GAC TTT GAG AAG ATC TAC 722 
Gly Ser Ser Thr Glu Arg Asp Phe Ala Val Asp Phe Glu Lys lie Tyr 

220 225 1 230 

i . 

AAA TAC CTG TCC TCC TCC TCC AGA GGT GGC CAT GGC CCT GAG CTC TCA 770 
Lys Tyr Leu Ser Ser Ser Ser Arg Gly Gly His Gly Pro Glu Leu Ser 
235 240 245 

GCA GTT GAG TCC GCC GTG GTC CTT AAC CTG CTC ATG TCA CTT GCA GAG 818 
Ala Val Glu Ser Ala Val Val Leu Asn Leu Leu Met Ser Leu Ala Glu 
250 255 260 265 

GAA CTG CCC CAC CTG CCT TGC ACA GCC CTG GTG GAC CAT CTG ACT AAA 866 
Glu Leu Pro His Leu Pro Cys Thr Ala Leu Val Asp His Leu Thr Lys 

270 275 280 

ACA TAT GCT CAA CTG ACG GCT CCC CAG GCC TCT CTC GAT AGA GAA AAG 914 
Thr Tyr Ala Gin Leu Thr Ala Pro Gin Ala Ser Leu Asp Arg Glu Lys 

285 290 295 

AGG CCC AGG CCG GGG ACT GAA GAT GGA GGG ACT GAC TCC ACG GGG CCA 962 
Arg Pro Arg Pro Gly Thr Glu Asp Gly Gly Thr Asp Ser Thr Gly Pro 
300 305 310 

GAA GAG ACT GGC CAA GGC AGT CCT CAA GCT TCT GAG CCC ACA GAA CCG 1010 
Glu Glu Thr Gly Gin Gly Ser Pro Gin Ala Ser Glu Pro Thr Glu Pro 
315 320 325 

AGG CTG ACC TGG AAA GCA GCT GGG GTC TGC CCA CTG AAC CCA TTC CTG 1058 
Arg Leu Thr Trp Lys Ala Ala Gly Val Cys Pro Leu Asn Pro Phe Leu 
330 335 340 345 

GTG CCC CTG GAC CTC CTG AGC CAG GCC CTC GCC CCT GCC AGG 1100 
Val Pro Leu Asp Leu Leu Ser Gin Ala Leu Ala Pro Ala Arg 

350 355 

TGAAGGATGC AGCAGGCTGG CTATGCTGGT TTCTTTCTAC ATCGTCTGTC CAGCACTGGT 1160 

ACTGGTGTCG GGAACGTGAT TGTGGCTTCC TGACAGTGTC CGGCCATAGG CAAGCCCTTC 1220 
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AGACACTGCA TGCACTAGTA GCCTGGAAAG TGAAGCACAG TTGAAGGAAG GGGTTTAACT 1280 

ATTGGGGATT TGAGACTGCA CAAAGGCGGA ATTCCTGGGC TGGTAGATTG CGGCAAGTGT 1340 

CCCTGGCCCC GCTGTGCCTG GTTCTCTGCC ATAAGAGTAC CACATCATCA TCCCTTTGTG 1400 

CATGGCACTC GAACCCACCC CCAGCACACC CTCCCCACCT CAACTGACCA CTGTACCATC 1460 

CTCTTCGCCC TTCTCTGGGT GTCCTGTGAC CCTCAATAAA GCTGTCATCT AACTAGCTGT 1520 

GAAAAAAAAA AAGTCGAC 1538 

(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 359 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 

Met Lys Pro Pro Gin Arg Arg Arg Lys Val Pro Ala Arg Tyr Thr Gly 
15 10 15 

Glu Ala Thr Gly Pro Thr Ala Trp Ser Pro Arg Glu Met Arg His Leu 

20 25 30 

Leu Arg Leu Leu Gin Ala Arg Arg Gly Gin Pro Glu Pro Asp Ala Lys 
35 40 45 

Glu Leu Ala Lys Glu Leu Arg Gly Arg Ser Glu Ala Glu lie Cys Arg 
50 55 60 

Phe lie Gin Gin Leu Lys Gly Arg Val Val Arg Glu Ala lie Arg Lys 
65 70 75 80 

Val Gin Pro Gly Gly Arg Asp Gly Pro Arg His Arg Glu Thr Gin Leu 

85 90 95 

Pro Ala Pro lie Glu Val Trp Met Asp Leu Ala Glu Lys Leu Thr Gly 

100 105 110 

Pro Leu Glu Glu Ala Leu Thr Ala Ala Phe Ser Gin Val Leu Thr lie 
115 120 125 

Ala Ala Ala Glu Pro Leu Ser Leu Leu His Ser Lys Pro Gly Lys Pro 
130 135 140 

Thr Lys Ala Cys Gly Arg Ala Leu Val Phe Leu Ser Thr Gin Asp Ala 
145 150 155 160 

Gin Lys Asp Ser Ala Pro Glu Gly Ser Gly Pro Glu Pro Met Thr Ala 

165 170 175 

Ala Asp Pro Thr Pro Glu Ala Ser Val Pro Glu Pro Lys Ala Ser Gly 

180 185 190 

Thr Asn Pro Glu Thr Ser Gly Leu Val Pro Glu Val Ser Val Pro Asp 
195 200 205 
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Pro Asp Ala Pro Thr Glu Ser Leu Ala Gly Ser Ser Thr Glu Arg Asp 
210 215 220 

Phe Ala Val Asp Phe Glu Lys He Tyr Lys Tyr Leu Ser Ser Ser Ser 
225 230 235 240 

Arg Gly Gly His Gly Pro Glu Leu Ser Ala Val Glu Ser Ala Val Val 

245 250 255 

Leu Asn Leu Leu Met Ser Leu Ala Glu Glu Leu Pro His Leu Pro Cys 

260 265 270 

Thr Ala Leu Val Asp His Leu Thr Lys Thr Tyr Ala Gin Leu Thr Ala 
275 280 285 

Pro Gin Ala Ser Leu Asp Arg Glu Lys Arg Pro Arg Pro Gly Thr Glu 
290 295 300 

Asp Gly Gly Thr Asp Ser Thr Gly Pro Glu Glu Thr Gly Gin Gly Ser 
305 310 315 320 

Pro Gin Ala Ser Glu Pro Thr Glu Pro Arg Leu Thr Trp Lys Ala Ala 

325 330 335 

Gly Val Cys Pro Leu Asn Pro Phe Leu Val Pro Leu Asp Leu Leu Ser 

340 345 350 

Gin Ala Leu Ala Pro Ala Arg 
355 

(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

{A) LENGTH: 2277 base pairs 
(B) TYPE: nucleic acid 
{C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 



(vii) IMMEDIATE SOURCE: 
(B) CLONE: HW045 

(ix> FEATURE: 

<A) NAME /KEY: CDS 

(B) LOCATION: 9.. 1628 

(ix) FEATURE: 

(A) NAME /KEY: misc_feature 

(B) LOCATION: 1973.. 2134 

(D) OTHER INFORMATION: /label* SAC 24028 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

GTCGACCA GGG CGT GAT GGC TCC TAC TAC CAG CTC TTT GTT CAT GGT GGC 50 
Gly Arg Asp Gly Ser Tyr Tyr Gin Leu Phe Val His Gly Gly 
15 10 
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CGC CTC CAG CCA GTC CTA AGG CAG AAA GCC TGC CGA GGC ATG AAC TGG 
Arg Leu Gin Pro Val Leu Arg Gin Lys Ala Cys Arg Gly Met Asn Trp 
15 20 25 30 

ATA GCT GGG CTT CGG ATG GTG CCC GAT GGC AGT ATG GTC GTC TTG GGT 
He Ala Gly Leu Arg Met Val Pro Asp Gly Ser Met Val Val Leu Gly 

35 40 45 

TTC CAT GCC AAC GAG TTT GTA GTG TGG AGC CCG CGG TCC CAT GAG AAG 
Phe His Ala Asn Glu Phe Val Val Trp Ser Pro Arg Ser His Glu Lys 

50 55 60 



98 



146 



194 



CTG CAC ATC GTC AAC TGC GGG GGA GGG CAC CGC TCC TGG GCC TTT TCT 
Leu His He Val Asn Cys Gly Gly Gly His Arg Ser Trp Ala Phe Ser 
65 70 75 



242 



GAT ACT GAG GCA GCC ATG GCC TTT GCC TAC CTT AAG GAT GGT GAT GTC 
Asp Thr Glu Ala Ala Met Ala Phe Ala Tyr Leu Lys Asp Gly Asp Val 
80 85 90 

ATG CTC TAC CGG GCT CTA GGA GGC TGC ATC AGG CCA AAC GTG ATT CTC 
Met Leu Tyr Arg Ala Leu Gly Gly Cys He Arg Pro Asn Val He Leu 
95 100 105 110 

CGG GAG GGT CTG CAT GGC CGG GAA ATC ACA TGT GTA AAG CGT GTG GGC 
Arg Glu Gly Leu His Gly Arg Glu He Thr Cys Val Lys Arg Val Gly 

115 120 125 



290 



338 



386 



ACT GTT ACC CTG GGC CCT GAA TTT GAG GTA CCC AAC TTG GAG CAT CCT 
Thr Val Thr Leu Gly Pro Glu Phe Glu Val Pro Asn Leu Glu His Pro 

130 135 140 

GAC TCC CTG GAG CCT GGC AGT GAG GGG CCT GGT CTG ATT GAC ATC GTG 
Asp Ser Leu Glu Pro Gly Ser Glu Gly Pro Gly Leu He Asp lie Val 
145 150 155 

ATA ACA GGC AGT GAG GAC ACT ACT GTC TGT GTG CTA GCA CTT CCC ACC 
He Thr Gly Ser Glu Asp Thr Thr Val Cys Val Leu Ala Leu Pro Thr 
160 165 170 

ACC ACA GGC GCA GCC CAC GCC CTC ACT GCT GTC TGT AAC CAT ATC TCC 
Thr Thr Gly Ala Ala His Ala Leu Thr Ala Val Cys Asn His He Ser 
175 180 185 190 

TCC GTG CGA GCC CTG GCA GTG TGG GGT GTT GGC ACC CCA GGT GGC CCA 
Ser Val Arg Ala Leu Ala Val Trp Gly Val Gly Thr Pro Gly Gly Pro 

195 200 205 

CAG GAT TCT CAC CCA GGG CTC ACG GCT CAG GTA GTG TCT GCA GGG GGT 
Gin Asp Ser His Pro Gly Leu Thr Ala Gin Val Val Ser Ala Gly Gly 

210 215 220 



434 



482 



530 



578 



626 



674 



CGA GCC GAG ATG CAC TGC TTC AGC CTC ATG ATC ACT CCG GAT GCC AGC 

Arg Ala Glu Met His Cys Phe Ser Leu Met He Thr Pro Asp Ala Ser 
225 230 235 

ACC CCA AGC CGC CTT GCC TGC CAC ATC ATG CAC CTT TCA TCC CAC CGG 
Thr Pro Ser Arg Leu Ala Cys His He Met His Leu Ser Ser His Arg 
240 245 250 

CTA GAT GAG TAC TGG GAC CGG CAG CGC AAC CGG CAC CGG ATG ATC AAG 
Leu Asp Glu Tyr Trp Asp Arg Gin Arg Asn Arg His Arg Met He Lys 
255 260 265 270 



722 



770 



818 



-97- 



WO 98/53071 



PCT/US98/10547 



GTG GAC CCT GAG ACC AGG TAC ATG TCT CTT GCC ATT TGT GAG CTT GAC 866 
Val Asp Pro Glu Thr Arg Tyr Met Ser Leu Ala He Cys Glu Leu Asp 

275 280 285 

TCC GAT AGG CCT GGC CTT GGC CCC GGC CCC CTT GTG GCT GCA GCC TGT 914 
Ser Asp Arg Pro Gly Leu Gly Pro Gly Pro Leu Val Ala Ala Ala Cys 

290 295 300 

AGT GAC GGG GCA GTG AGG CTT TTT CTC TTG CAG GAC TCT GGG CGA ATT 962 
Ser Asp Gly Ala Val Arg Leu Phe Leu Leu Gin Asp Ser Gly Arg He 
305 310 315 

CTG CAT CTC CTT GCT GAG ACT TTC CAC CAT AAG ■ CGG TGT GTC CTC AAG 1010 
Leu His Leu Leu Ala Glu Thr Phe His His Lys Arg Cys Val Leu Lys 
320 325 330 

GTC CAT TCG TTC ACA CAT GAG GCA CCC AAC CAG CGT CGG AGG CTG ATC 1058 
Val His Ser Phe Thr His Glu Ala Pro Asn Gin Arg Arg Arg Leu He 
335 340 345 350 

CTG TGC AGT GCA GCT ACA GAT GGC AGC ATA GCC TTC TGG GAT CTC ACC 1106 
Leu Cys Ser Ala Ala Thr Asp Gly Ser He Ala Phe Trp Asp Leu Thr 

355 360 365 

ACG GCA ATG GAC CGA GGC TCT ACT ACC CTG GAG CCT CCA GCA CAC CCT 1154 
Thr Ala Met Asp Arg Gly Ser Thr Thr Leu Glu Pro Pro Ala His Pro 

370 1 375 380 

GGG CTT CCC TAC CAG ATG GGC ACC CCC TGC CTG ACC GTC CAG GCC CAT 1202 
Gly Leu Pro Tyr Gin Met Gly Thr Pro Cys Leu Thr Val Gin Ala His 
385 390 395 

AGC TGT GGT GTC AAC AGC CTG CAC ACT TTG CCT ACA CCT GAG GGC CAT 1250 
Ser Cys Gly Val Asn Ser Leu His Thr Leu Pro Thr Pro Glu Gly His 
400 405 410 

CAT CTT GTG GCC AGT GGC AGT GAG GAC GGG TCC CTG CAT GTC TTC ACG 1298 
His Leu Val Ala Ser Gly Ser Glu Asp Gly Ser Leu His Val Phe Thr 

415 420 425 430 

CTT GCT GTG AAG ATG CCA GAG CTG GAA GAG GCT GAT GGG GAG GCT GAG 1346 
Leu Ala Val Lys Met Pro Glu Leu Glu Glu Ala Asp Gly Glu Ala Glu 

435 440 445 

TTG GTG CCC CAG TTA TGT GTC CTA GAT GAA TAT TCT GTC CCC TGT GCA 1394 
Leu Val Pro Gin Leu Cys Val Leu Asp Glu Tyr Ser Val Pro Cys Ala 

450 455 460 

CAT GCT GCC CAT GTG ACG GGC ATC AAG ATC CTA AGT CCC AAG CTG ATG 1442 
His Ala Ala His Val Thr Gly He Lys He Leu Ser Pro Lys Leu Met 
465 470 475 

GTC TCA GCC TCC ATA GAC CAA CGG CTG ACC TTC TGG CGT CTT GGA AAT 1490 
Val Ser Ala Ser He Asp Gin Arg Leu Thr Phe Trp Arg Leu Gly Asn 
480 485 490 

GGT GAG CCC ACC TTC ATG AAT AGC ACT GTG TAC CAC GTG CCA GAT GTG 1538 
Gly Glu Pro Thr Phe Met Asn Ser Thr Val Tyr His Val Pro Asp Val 
495 500 505 510 

GCT GAC ATG GAC TGC TGG CCT GTG AAC CCT GAG TTT GGC CAC CGC TGT 1586 
Ala Asp Met Asp Cys Trp Pro Val Asn Pro Glu Phe Gly His Arg Cys 
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515 520 525 

GCT CTT GCG GGT CAG GGA CTC GAG GTT TAC AAC TGG TAT GAC 1628 
Ala Leu Ala Gly Gin Gly Leu Glu Val Tyr Asn Trp Tyr Asp 

530 535 540 

TGAGTTACCA CAGTAGTTGG AGGACTGAGC CTGGGGCTTG ACCACAGACA GCGGAGCAGG 1688 

GATCAGCTGT CTGTGTCATG CTCTATGTGC TCTGAGGAGG TGAGGCAGTA CCATGGGTTC 1748 

CTGTAGGGTG TTGCATAGGT GGACCAAGAA TATGCCTCAC TCTTCACAAT AGGATGAAAC 1808 

TGTATTTATT CTGACTTTAA GTGCCCAACA TCTGTGAGGT TTCTGTGAGG TCTTGTTTTT 1868 

TTCCCAGTTG ATGCTTTTAT AAACATTCCC AGCTATTGGG CCCTTAGATG TGGCTCAGCG 1928 

GAGGGAGGCC CAGCATGGCC AAGCCTGTGT GGAACACCTC ACGTACTGCC CTCAAAAGCT 1988 

GTAGGCGAGC AAACATCTGA CCAAAGAGGT GTGGCCGAGG TTCCTGAAAG AAAAGCAACC 2048 

AGGCCCATCC TCATTTCCCG AGCCTGAGCC CTTGTCCATA TTTCCCACAG ACCCTCCCTT 2108 

GCTGTATGCT CACCCCTAGA ATGTGTACGC GGTTATAGTA TGAGCTGAAA TCCATGCTGA 2168 

GCTGCACCAG GAACTTGCAC ACCTAGAGAC AGAGGTTGGA TCGTTGAGCT GTTGTCTTTT 2228 

TTCTTGTGTC ACAACCCAGA ATAAAGAATA GTGTGTGTAG TGTCAAAAA 2277 

(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 540 amino acids 

(B) TYPE: amino acid 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 

Gly Arg Asp Gly Ser Tyr Tyr Gin Leu Phe Val His Gly Gly Arg Leu 
15 10 15 

Gin Pro Val Leu Arg Gin Lys Ala Cys Arg Gly Met Asn Trp lie Ala 

20 25 30 

Gly Leu Arg Met Val Pro Asp Gly Ser Met Val Val Leu Gly Phe His 
35 40 45 

Ala Asn Glu Phe Val Val Trp Ser Pro Arg Ser His Glu Lys Leu His 
50 55 60 

lie Val Asn Cys Gly Gly Gly His Arg Ser Trp Ala Phe Ser Asp Thr 
65 70 75 80 

Glu Ala Ala Met Ala Phe Ala Tyr Leu Lys Asp Gly Asp Val Met Leu 

85 90 95 

Tyr Arg Ala Leu Gly Gly Cys lie Arg Pro Asn Val lie Leu Arg Glu 

100 105 110 

Gly Leu His Gly Arg Glu He Thr Cys Val Lys Arg Val Gly Thr Val 
115 120 125 
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Thr Leu Gly Pro Glu Phe Glu Val Pro Asn Leu Glu His Pro Asp Ser 
130 135 140 

Leu Glu Pro Gly Ser Glu Gly Pro Gly Leu lie Asp lie Val lie Thr 
145 150 155 160 

Gly Ser Glu Asp Thr Thr Val Cys Val Leu Ala Leu Pro Thr Thr Thr 

165 170 175 

Gly Ala Ala His Ala Leu Thr Ala Val Cys Asn His lie Ser Ser Val 

180 165 190 

Arg Ala Leu Ala Val Trp Gly Val Gly Thr Pro Gly Gly Pro Gin Asp 
195 200 205 

Ser His Pro Gly Leu Thr Ala Gin Val Val Ser Ala Gly Gly Arg Ala 
210 215 220 

Glu Met His Cys Phe Ser Leu Met lie Thr Pro Asp Ala Ser Thr Pro 
225 230 235 240 

Ser Arg Leu Ala Cys His lie Met His Leu Ser Ser His Arg Leu Asp 

245 250 255 

Glu Tyr Trp Asp Arg Gin Arg Asn Arg His Arg Met lie Lys Val Asp 

260 265 270 

Pro Glu Thr Arg Tyr Met Ser Leu Ala lie Cys Glu Leu Asp Ser Asp 
275 280 285 

Arg Pro Gly Leu Gly Pro Gly Pro Leu Val Ala Ala Ala Cys Ser Asp 
290 295 300 

Gly Ala Val Arg Leu Phe Leu Leu Gin Asp Ser Gly Arg lie Leu His 
305 310 315 320 

Leu Leu Ala Glu Thr Phe His His Lys Arg Cys Val Leu Lys Val His 

325 330 335 

Ser Phe Thr His Glu Ala Pro Asn Gin Arg Arg Arg Leu lie Leu Cys 

340 345 350 

Ser Ala Ala Thr Asp Gly Ser He Ala Phe Trp Asp Leu Thr Thr Ala 
355 360 365 

Met Asp Arg Gly Ser Thr Thr Leu Glu Pro Pro Ala His Pro Gly Leu 
370 375 380 

Pro Tyr Gin Met Gly Thr Pro Cys Leu Thr Val Gin Ala His Ser Cys 
385 390 395 400 

Gly Val Asn Ser Leu His Thr Leu Pro Thr Pro Glu Gly His His Leu 

405 410 415 

Val Ala Ser Gly Ser Glu Asp Gly Ser Leu His Val Phe Thr Leu Ala 

420 425 430 

Val Lys Met Pro Glu Leu Glu Glu Ala Asp Gly Glu Ala Glu Leu Val 
435 440 445 



Pro Gin Leu Cys Val Leu Asp Glu Tyr Ser Val Pro Cys Ala His Ala 
450 455 460 
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Ala His Val Thr Gly lie Lys He Leu Ser Pro Lys Leu Met Val S r 
465 470 475 480 

Ala Ser He Asp Gin Arg Leu Thr Phe Trp Arg Leu Gly Asn Gly Glu 

485 490 495 

Pro Thr Phe Met Asn Ser Thr Val Tyr His Val Pro Asp Val Ala Asp 

500 505 510 

Met Asp Cys Trp Pro Val Asn Pro Glu Phe Gly His Arg Cys Ala Leu 
515 520 525 

Ala Gly Gin Gly Leu Glu Val Tyr Asn Trp Tyr Asp 
530 535 540 

(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2373 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double t 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: HW046 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 9.. 1160 

(ix) FEATURE: 

(A) NAME /KEY: misc„f eature 

(B) LOCATION: complement (1523.. 1760) 
(D) OTHER INFORMATION : /label* SACJ24036 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 

GTCGACGT GAC TTC CTT GTC AGG TTC ACC CCC GCT GCA GCA GGG CTC TGC 50 
Asp Phe Leu Val Arg Phe Thr Pro Ala Ala Ala Gly Leu Cys 
15 10 

TCT CTG ACT GAG TGG TGC TGT GTT ACA TGG ATC TGT CAT TTC TAT TCA 98 
Ser Leu Thr Glu Trp Cys Cys Val Thr Trp He Cys His Phe Tyr Ser 
15 20 25 30 

GTC ATC AGT TGG CAT TGC AGT AAC ACC TAT GGC AAT GAA GAA AGA GAC 146 
Val He Ser Trp His Cys Ser Asn Thr Tyr Gly Asn Glu Glu Arg Asp 

35 40 45 

AGA ATT TGG GAT TTT CAT CTA AGG AAA ATG ATA AAT AAC AAC CAG ACG 194 
Arg He Trp Asp Phe His Leu Arg Lys Met He Asn Asn Asn Gin Thr 

50 55 60 

TGT GCT GCA GGG CAG GAC TCC GTG CCC TAC GTG ACC TGT ATG CTT CAC 242 
Cys Ala Ala Gly Gin Asp Ser Val Pro Tyr Val Thr Cys Met Leu His 
65 70 75 
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GTG CTG GAA GAG TGG TTG GGT GTG GAA CAG TTG GAG GAC TAG CTG AAT 290 
Val Leu Glu Glu Trp Leu Gly Val Glu Gin Leu Glu Asp Tyr Leu Asn 
80 85 90 

TTT GCG AAC CAT CTC TTG TGG GTC TTC ACC CCG CTG ATC CTT TTA ATA 338 
Phe Ala Asn His Leu Leu Trp Val Phe Thr Pro Leu He Leu Leu He 
95 100 105 110 

CTT CCG TAG TTT ACC ATC TTC CTT CTC TAC CTT ACT ATT ATT TTC CTC 386 
Leu Pro Tyr Phe Thr He Phe Leu Leu Tyr Leu Thr He He Phe Leu 

115 120 125 

CAC ATC TAT AAG AGG AAG AAT GTG TTA AAA GAA GCC TAC TCT CAC AAC 434 
His He Tyr Lys Arg Lys Asn Val Leu Lys Glu Ala Tyr Ser His Asn 

130 135 140 

TTG TGG GAT GGC GCG AGG AAA ACA GTG GCA ACC CTG TGG GAT GGA CAT 482 
Leu Trp Asp Gly Ala Arg Lys Thr Val Ala Thr Leu Trp Asp Gly His 
145 150 155 

GCG GCG GTT TGG CAT GGT TAT GAA GTT CAC GGG ATG GAA AAG ATA CCA 530 
Ala Ala Val Trp His Gly Tyr Glu Val His Gly Met Glu Lys He Pro 
160 165 170 

GAA GGA CCA GCA CTT ATA ATT TTT TAT CAT GGA GCT ATT CCC ATA GAC 578 
Glu Gly Pro Ala Leu He He Phe Tyr His Gly Ala He Pro He Asp 
175 1 180 185 190 

TTT TAC TAC TTC ATG GCT AAA ATT TTC ATC CAC AAA GGC AGA ACT TGC 626 
Phe Tyr Tyr Phe Met Ala Lys He Phe He His Lys Gly Arg Thr Cys 

195 200 205 

CGA GTG GTA GCT GAC CAC TTT GTC TTT AAA ATC CCA GGG TTC AGT TTA 674 
Arg Val Val Ala Asp His Phe Val Phe Lys He Pro Gly Phe Ser Leu 

210 215 220 

TTA CTT GAT GTA TTT TGT GCT CTT CAT GGA CCA AGA GAA AAA TGT GTT 722 
Leu Leu Asp Val Phe Cys Ala Leu His Gly Pro Arg Glu Lys Cys Val 

225 230 235 

GAA ATC TTG AGG AGT GGT CAC TTG TTA GCT ATT TCA CCG GGT GGA GTT 770 
Glu He Leu Arg Ser Gly His Leu Leu Ala He Ser Pro Gly Gly Val 
240 245 250 

CGA GAA GCC TTA CTT AGT GAT GAA ACC TAC AAC ATC ATA TGG GGT AAT 818 
Arg Glu Ala Leu Leu Ser Asp Glu Thr Tyr Asn He He Trp Gly Asn 
255 260 265 270 

CGT AAA GGC TTT GCT CAG GTT GCA ATC GAT GCA AAA GTG CCC ATT ATT 866 
Arg Lys Gly Phe Ala Gin Val Ala He Asp Ala Lys Val Pro He He 

275 280 285 

CCT ATG TTT ACA CAA AAC ATC CGA GAA GGA TTT AGA TCA CTC GGA GGA 914 
Pro Met Phe Thr Gin Asn He Arg Glu Gly Phe Arg Ser Leu Gly Gly 

290 295 300 

ACA AGA TTG TTT AAA TGG CTT TAT GAA AAA TTC CGC TAT CCG TTT GCT 962 
Thr Arg Leu Phe Lys Trp Leu Tyr Glu Lys Phe Arg Tyr Pro Phe Ala 
305 310 315 

CCA ATG TAT GGA GGT TTT CCT GTG AAG TTG CGG ACC TTC TTG GGT GAT 1010 
Pro Met Tyr Gly Gly Phe Pro Val Lys Leu Arg Thr Phe Leu Gly Asp 
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320 325 330 

CCT ATT CCG TAT GAC CCA GAG GTA ACA GCA GAA GAA TTA GCT GAA AAG 1058 
Pro He Pro Tyr Asp Pro Glu Val Thr Ala Glu Glu Leu Ala Glu Lys 
335 340 345 350 

ACT AAG AAC GCT GTT CAA GCT TTG ATC GAC AAG CAC CAA AGG ATA CCG 1106 
Thr Lys Asn Ala Val Gin Ala Leu He Asp Lys His Gin Arg He Pro 

355 360 365 

GGG AAC ATT AGG AGT GCT TTG CTG GAC CGC TTT CAT AAA GAG CAG AAG 1154 
Gly Asn He Arg Ser Ala Leu Leu Asp Arg Phe His Lys Glu Gin Lys 

370 375 380 

GCT AAT TGACAGATAA TTTCGTGTGT CTCTAATGCC AAGTTTGCAT CTGTGGTACT 1210 
Ala Asn 



GCCTTTTAAA 


CTTTTGTAGG 


TTGTATAATA 


GTCTTTTTAA 


AACCTGCTAA 


TGAGCATCTT 


1270 


TCTTTAGAAT 


TTAAACTTTC 


TATGGTTAAT 


CTTACCCTTT 


CAATCAGTTT 


TGTCAAATGT 


1330 


AGTAAGTAAC 


TCACCATTTG 


CTTGACTCAG 


AAGATGATCA 


TGTTGCATTT 


GCAATCCTTA 


1390 


GTAGTATATG 


ATAGGCACAA 


GTCTTCCTGG 


ATAACAGCAG 


TCTAGGTTAG 


GCAGACTTCA 


1450 


CTAAATTATG 


CTTCTGTGTT 


TGTGTCTGTT 


GTGCCAATGG 


GATGGTGAGA 


GGTACCTGAG 


1510 


CTGGGGTGTA 


GCACTTGCTT 


AGTTTAGGTG 


AGCGGTGCCT 


t TTGTATCCAT 


CCTTCATTCC 


1570 


ACTCACAACT 


GCTGGTAAGT 


TGTGGAACGC 


TTTTGCACCT 


CCGTGGTTTA 


TGTTAAGCAT 


1630 


GGTAATACTG 


TGGTGGTGAA 


CATTAAGTTG 


CTTATGTCTA 


CTGTGATATA 


TCCGTATCTA 


1690 


TATATCTGAT 


AGATATCTGT 


AGTGAGAAAT 


CTGTAATTGA 


ATTGATTCCT 


GAATCCTTTA 


1750 


GGAGTTATGT 


ACATTTTTGT 


CCATTGGGTA 


TTGAAATTAC 


TCCTTCAGGG 


ATGATGTCCA 


1810 


GTTGCCAAGC 


ACAATGAAAG 


AGATGTACTA 


TTTTGTAAAT 


CAAATCATTG 


TTACTTGTAG 


1870 


TATTTATTAA 


CAGAGTTGTT 


TGGTATTTAT 


TTTAAACAAG 


GATCTAAGTA 


AATGTATAGG 


1930 


ATGGATTGTC 


AGCACGTGTC 


TTACACGGTG 


TAAAGATAAA 


ATCGTGGGCA 


CGCTCTGCAG 


1990 


TTTATGTTAG 


AGCAGATCCT 


AGGCTGAGGA 


GTGACAACTG 


AGGAAGCTCT 


GCCTCGCTTA 


2050 


TCCCAGGGCT 


CGCTTCCGTG 


CTCTGCAGCC 


AGTCACGTGG 


TTATATCTGT 


AAAATGTACT 


2110 


TGAATGATGA 


ATTTCTGTAA 


ACATGGAGTC 


CTCTGAAAAT 


GTAATTACTT 


TGAATTGTTG 


2170 


TCTTCTTGAA 


AACTTATTTT 


AAAAGTATTT 


AAACAGCCAA 


GTATCATGAA 


TTCATCGCAG 


2230 


TGTGTCTTCC 


CCCTTTGCAG 


TACTTGGGTT 


TTATATGGAC 


TTTTTAACTA 


CAAGTGTATA 


2290 


TTTGTGATAT 


GTATAATGTA 


TGTATAATGT 


ATATAGAAAT 


GGTGTGAAAA 


TAAAGCTGAA 


2350 


CTGCAACTCC 


TCTCTAGTAA 


AAA 








2373 



(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 384 amino acids 
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(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 

Asp Phe Leu Val Arg Phe Thr Pro Ala Ala Ala Gly Leu Cys Ser Leu 
15 10 15 

Thr Glu Trp Cys Cys Val Thr Trp lie Cys His Phe Tyr Ser Val He 

20 25 30 

Ser Trp His Cys Ser Asn Thr Tyr Gly Asn Glu Glu Arg Asp Arg He 
35 : 40 45 

Trp Asp Phe His Leu Arg Lys Met He Asn Asn Asn Gin Thr Cys Ala 
50 55 60 

Ala Gly Gin Asp Ser Val Pro Tyr Val Thr Cys Met Leu His Val Leu 
65 70 75 80 

1 ■ i 

Glu Glu Trp Leu Gly Val Glu Gin Leu Glu Asp Tyr Leu Asn Phe Ala 

85 90 95 

Asn His Leu Leu Trp Val Phe Thr Pro Leu He Leu Leu He Leu Pro 

100 - 105 110 

Tyr Phe Thr He Phe Leu Leu Tyr Leu Thr He He Phe Leu His He 
115 120 125 

Tyr Lys Arg Lys Asn Val Leu Lys Glu Ala Tyr Ser His Asn Leu Trp 
130 135 140 

Asp Gly Ala Arg Lys' Thr Val Ala Thr Leu Trp Asp Gly His Ala Ala 
145 150 155 160 

Val Trp His Gly Tyr Glu Val His Gly Met Glu Lys He Pro Glu Gly 

165 170 175 

Pro Ala Leu He He Phe Tyr His Gly Ala He Pro He Asp Phe Tyr 

180 185 190 

Tyr Phe Met Ala Lys He Phe He His Lys Gly Arg Thr Cys Arg Val 
195 200 205 

Val Ala Asp His Phe Val Phe Lys He Pro Gly Phe Ser Leu Leu Leu 
210 215 220 

Asp Val Phe Cys Ala Leu His Gly Pro Arg Glu Lys Cys Val Glu He 
225 230 235 240 

Leu Arg Ser Gly His Leu Leu Ala He Ser Pro Gly Gly Val Arg Glu 

245 250 255 

Ala Leu Leu Ser Asp Glu Thr Tyr Asn He He Trp Gly Asn Arg Lys 

260 265 270 

Gly Phe Ala Gin Val Ala He Asp Ala Lys Val Pro He He Pro Met 
275 280 285 

Phe Thr Gin Asn He Arg Glu Gly Phe Arg Ser Leu Gly Gly Thr Arg 
290 295 300 
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Leu Phe Lys Trp Leu Tyr Glu Lys Phe Arg Tyr Pro Phe Ala Pro Met 
305 310 315 320 

Tyr Gly Gly Phe Pro Val Lys Leu Arg Thr Phe Leu Gly Asp Pro He 

325 330 335 

Pro Tyr Asp Pro Glu Val Thr Ala Glu Glu Leu Ala Glu Lys Thr Lys 

340 345 350 

Asn Ala Val Gin Ala Leu He Asp Lys His Gin Arg He Pro Gly Asn 
355 360 365 

He Arg Ser Ala Leu Leu Asp Arg Phe His Lys Glu Gin Lys Ala Asn 
370 375 ' 380 

(2) INFORMATION FOR SEQ ID NO: 39; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2241 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: HW047 

(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 8.. 511 

(ix) FEATURE: 

(A) NAME /KEY: misc^f eature 

(B) LOCATION: complement (1675.. 1833) 

(D) OTHER INFORMATION: /label= SAC.23915 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:39: 

GTCGACC AGG GCC TTC CAT CAG ACC TGC CCA AAC TTT GTC ATA GAG AAG 49 
Arg Ala Phe His Gin Thr Cys Pro Asn Phe Val He Glu Lys 
15 10 

ATT GAA AGG ATT CAA AAT CCA GCT CTT TGG AGG AGG TAC CAG GCG TAC 97 
He Glu Arg He Gin Asn Pro Ala Leu Trp Arg Arg Tyr Gin Ala Tyr 
15 20 25 30 

AAA AAA ATT ATG GAT GAA AAG AAT GGC AAT GTG ATA AAT GAG AAG CAA 145 
Lys Lys He Met Asp Glu Lys Asn Gly Asn Val He Asn Glu Lys Gin 

35 40 45 

CTC TTC CAT GGG ACG GAG TTT GGC TCT CTG GCA CAG CTC AAC AGT AAT 193 
Leu Phe His Gly Thr Glu Phe Gly Ser Leu Ala Gin Leu Asn Ser Asn 

50 55 60 

GGA TTT AAC CGC AGC TAT GCT GGG AAG AAC GCT ACG GCT TAT GGG AAA 241 
Gly Phe Asn Arg Ser Tyr Ala Gly Lys Asn Ala Thr Ala Tyr Gly Lys 
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65 70 75 

GGA ACC TAT TTT GCA GTC AAT GCT TCC TAT TCT GCC CAT GAC ACA TAT 289 
Gly Thr Tyr Phe Ala Val Aan Ala Ser Tyr Ser Ala His Asp Thr Tyr 
80 85 90 

TCC AAA CCA GAT GCC AAT GGG AAA AAG TAT ATG TAT TAT GTG CGG GTC 337 
Ser Lys Pro Asp Ala Asn Gly Lys Lys Tyr Met Tyr Tyr Val Arg Val 
95 100 105 110 

CTC ACT GGA AAC TAC ACA CAA GGA AAC GCA TCA CTG ATT GTG CCT CCT 385 
Leu Thr Gly Asn Tyr Thr Gin Gly Asn Ala Ser Leu He Val Pro Pro 

115 120 125 

TCA AGG GAC CCT CAA AAT GCT ACT GAC TTG TTT GAC ACT GTC GCA GAT 433 
Ser Arg Asp Pro Gin Asn Ala Thr Asp Leu Phe Asp Thr Val Ala Asp 

130 135 140 

AAT GTT ATA AAT CCG AGT ATA TTT GTA GTG TTT TAT GAC AAT CAA GCA 481 
Asn Val He Asn Pro Ser He Phe Val Val Phe Tyr Asp Asn Gin Ala 
145 150 155 

TAC CCT GAG TAT CTT ATC ACA TTT AGG AAG TAACACTTTG GGGTGCCTTC 531 
Tyr Pro Glu Tyr Leu He Thr Phe Arg Lys 
160 165 



TGAAAGACAT 


TGGTATCTGC 


CGTAGGACAA 


ATTTTAGCTG 


TTTTCCTTCT 


TAGACTTTCT 


591 


TCTCTAAGAG 


CACGAGGTCT 


TTCTCTGCCA 


CGGATTGTTT 


CCTCAGCTGC 


CCCTTCATAG 


651 


TGAACATGAA 


CCTACTTGAG 


GGCCAACCAC .TTGAGAATTT 


CAAGCAGGTG 


ACAGCTTTAG 


711 


CTGCCTTCAC 


AAATGTGAAC 


CTCAAATCCA 


CGGAAAAGAA 


CAAACTTACG 


TTCTGGAAGG 


771 


ACAATAAAAC 


CTTTATGCAC 


TTTCAGAAAG 


CCCACAATAC 


ACCCACCACC 


AGTGGAGTTA 


831 


CATCAGAGCA 


CCACTGTGCT 


GAGGGGCACC 


CAGGACCACT 


GAGCCAGTAA 


GGCTGCCAAC 


891 


TGAACAGATT 


CCAAGACAGA 


GAACTTCGTT 


CATCACTAAC 


AGCGCCGTGT 


CCTGTGGGTT 


951 


GGACAGTTCC 


AGCCAGGGTT 


ATG TAC AG AC 


TGGGTTCTGG 


GATGTGGCTC 


TGGGTCTGTG 


1011 


TGAACTGAGC 


CCCCCAAGGA 


GAGGCACAGC 


TGAAGCCCTG 


TCAGGGGGCC 


CTTTTAATGT 


1071 


TGACACCAAT 


GTTGAGGCTA 


CCTAGTGAAG 


AAATGGGATG 


GGGCCTGCTA 


CCTCTCCTGT 


1131 


GGAAGTCCTT 


CTACTGGGTG 


AAAATGAATG 


AACTCTAACA 


GTATTGGCTA 


GCGTCTTTCT 


1191 


TGTGGCAGGC 


TGGCTTGTGA 


TAACTTGAAG 


CTTTATGACT 


AAGTCAGTCA 


TTTTGCTAAC 


1251 


TCACTAACAG 


TATCTACTGG 


ATGCGATTTG 


TTTCTTGTGA 


TGCCAGAGAT 


CACCCAGAGC 


1311 


CTCACATGGG 


CCAGGCAAGC 


ACTCTACTAC 


TGAACTATAT 


TCCCAGCCTC 


TTAGTGAGGC 


1371 


AAGACTCCAC 


TAAGTTGCCC 


TGGCAGGTTT 


GAACTCATTA 


TCTTCCTGCA 


TTGGCTACTT 


1431 


ATTGGGGTGG 


GGTGGGGTGC 


TTAAATTGTT 


CTTTTAATTT 


TCTCCTGTAC 


TCCCATTTAT 


1491 


TGAGTGCCTG 


GACCCTTGTC 


AGGTAGAATA 


AGAGGTGTGT 


TCACATAGAT 


AATTTTGTTT 


1551 


CCATAAAAAA 


AAATGCACTG 


TAAGAATTAT 


GTCAAGGCCT 


AGGAGCAGGC 


TCAGAGTACA 


1611 


AGAGCTGTCC 


TTGCCAAAGG 


TCACTAAGGC 


AGGAACCGCA 


GAGCCAGCCA 


CCAGAGTCTT 


1671 
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\3\S X X Uvu X 


TGAGCCCCTC 


ACTATTCCCA 


GCACAGGGCC 


TGGCCCTGAG 


GTAAGCCTGT 


1731 


fiPTRPTTPAG 


AAGTTGGCTT 


TTCTCCTTCT 


GGGATTGGTG 


TCCACTCTGC 


CCCTTTTCTA 


1791 




CTAGTGCGGT 


GTAATGTTCC 


CAGGAGGTAG 


GTACACTCTT AGGTGTTGTG 


1851 


fif3rtATPPAAG 


TGCTGGTTCC 


CACCATTGGC 


ATTAACAGTG 
n x x ruivoU x vi 


GACTGCAAGC 


AGAGAGAGGG 


1911 


rjmpppp*TPA 
Vj X ^.^>v* v^-ri X v^/a 


X XwX X XwWaw 


GACACTGAGG 


PC TTGAG APP 


TTGGCTGCTG 


TGTCTATGTG 


1971 


GACTGGCAAA 


GACTGTGGCT 


TGGGAGGGGC 


TTCCTCAGCA 


CTGGCACCTG 


AACTCAGCAC 


2031 


CAAGGATGGC 


CCCAAGAGGC 


TTGTTTTCAT 


TTGCCTTTGT 


TCTCCAGTGC 


TTTGGTTACC 


2091 


TCAGGGCAGG 


GCTGAACAAT 


AGTGTGATGT 


TACATTGAAG 


AATAAAGACC 


CAACCAACAA 


2151 


ATGTTAGTGT 


CATTTTCAGT 


CTAAGTAAAT 


AAGCCATTAT 


GTCATTTCCC 


TTACTTCCTA 


2211 


GTAAAGTTAT 


GTAATTACAT 


AATGTAAAAA 








2241 



(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 168 amino acids i 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 

Arg Ala Phe His Gin Thr Cys Pro Asn Phe Val lie Glu Lys lie Glu 
15 10 15 

Arg lie Gin Asn Pro Ala Leu Trp Arg Arg Tyr Gin Ala Tyr Lys Lys 

20 25 30 

lie Met Asp Glu Lys Asn Gly Asn Val lie Asn Glu Lys Gin Leu Phe 
35 40 45 

His Gly Thr Glu Phe Gly Ser Leu Ala Gin Leu Asn Ser Asn Gly Phe 
50 55 60 

Asn Arg Ser Tyr Ala Gly Lys ABn Ala Thr Ala Tyr Gly Lys Gly Thr 
65 70 75 80 

Tyr Phe Ala Val Asn Ala Ser Tyr Ser Ala His Asp Thr Tyr Ser Lys 

85 90 95 

Pro Asp Ala Asn Gly Lys Lys Tyr Met Tyr Tyr Val Arg Val Leu Thr 

100 105 110 

Gly Asn Tyr Thr Gin Gly Asn Ala Ser Leu lie Val Pro Pro Ser Arg 
115 120 125 

Asp Pro Gin Asn Ala Thr Asp Leu Phe Asp Thr Val Ala Asp Asn Val 
130 135 140 

He Asn Pro Ser He Phe Val Val Phe Tyr Asp Asn Gin Ala Tyr Pro 
145 150 155 160 
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Glu Tyx Leu lie Thr Phe Arg Lys 

165 

(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 1918 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: HW049 

(ix) FEATURE: 

(A) NAME /KEY: misc_feature 

(B) LOCATION: complement (1289.. 1612) 
(D) OTHER INFORMATION: /label= SAC_24337 

^ 

I 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 



GTCGACGTGG 


TGATTATCTT 


AAAGGTAAGG 


GAGGTTTTTT 


TTTCCCTGAT 


ACTGATGAAG 


60 


GCTGAGTGTG 


ACTGTTTAGT 


ATAGCTAGTG 


GGCTTTGTCA 


CTTAAGCCTT 


CCTTAGCATG 


120 


TTCCCTTTCC 


CTTATTTTTC 


CTGGATACTT 


ACGTGCTAAC 


GTGCTAACAA 


CTCCAGATCA 


180 


AACTGAGTAT 


GGAGATTTTT 


GCTTTAACAA 


TTTAGGAGTG 


GCAAACAATT 


TATAACTGGC 


240 


AGGGTTTGAG 


GACAGCCGTG 


TTGAAAAACC 


AAAAGAAAAT 


GCTAAGGCAG 


GTCTTGACCA 


300 


ATGTGGATAC 


TTTGACTAAT 


TAACTTCGTT 


ACTTTAGGAG 


GAGGTTGTTC 


CTGCACGCCA 


360 


TCCATCTCCA 


GCGTTCGAAG 


GTTAGATTGT 


TGGGGTGGTG 


TGTCTTGTTC 


ACCAGATTGG 


420 


CTGTTTCTGT 


GATGAATCAA 


CTAGCTCACT 


ATGACTGCTA 


ATGAAAACAC 


TGGAACACCT 


480 


GAGAAACTGA 


AGGGTGGCGT 


GTTGGGGCAA 


GGGTGCTTAC 


ATATCAAGCC 


TTTTCAATAG 


540 


TTTGTTTTTC 


TCCAGGTTCT 


GTTAGCAGTG 


CTGTTACTTG 


GTGCTTCAGA 


GGTAAGTAAT 


600 


TGCAGCCAGC 


GTGCAGTGGT 


GTGCACACTG 


CTGGACTCAG 


GTCCTAAGTA 


CTTCATATAT 


660 


ATGATGACTT 


TCATAGAATC 


TCGTTCGGCT 


GATGATTGCT 


GTTGAGACTT 


GGAAATCTGA 


720 


TTTTTCTGAG 


GATGATGGGT 


TGGATGGGAA 


CAATCTGCCA 


GGATTTTTAA 


CATTGCCCAT 


780 


CTCGTGTGTT 


TCAGATATGG 


TGACTGGCCG 


AGCGCAAAGG 


AGCCATGCCA 


GGTAAATGTT 


840 


CTAGCAATAT 


TATTGTTTAT 


TAGGTAGATA 


CGTAATGGCT 


TGATCCAGTG 


CTTTCCTGCT 


900 


CACCCTGATG 


AACTGAATAC 


CGCCCCAGTC 


TGATAGCTGT 


GGAGAAAGGT 


ATTTTCTGAG 


960 


TTGAGCTCAT 


CTTGGATCAA 


ACAGGAAGGG 


TTTTATGCTA 


GTTAGTATAT 


GATCTCTGTA 


1020 
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GGATACGTTT AGGCAATAAG GATAATGACC CAAATTTATT TACTGTTTTG CAGGAAAACT 1080 

AAGAGCACAT CTTCATAAAG TCCAGGCTAA AGTAAGGTGA GAAAACTAAG ATGTAAAGTG 1140 

TACCAAGTTA GTAACTTCCC TATGAAGAAA CATTCACACG TCTTACTTCC TGTCCTAGCT 1200 

CCAGAGCCTG AAAAGGTGAA CCCACTGGGG CTGGCTGGGG GAAAAGAGGA AATTTTGTTC 1260 

CAGAAGGAAC TGTCTGAGGG ATCATTTGTA TGTGTGGAAA GGGAAGAATT AAACTTTGAT 1320 

TAAAAAGGAA ACAAGGTGGT TTGTCTGTTT TTCAGGTCTG AGGAGAGGGA TGAGGTGTGT 1380 

GTTAGAGGTA GGTAGCCCTT CAAGTCATTG TAATTAGGTG GGCAGGGTAG ATTTTGCTTT 1440 

CTGCTAATTA AAACTTTTAC CTTACAGGAA CTGAAATCAA AGATCCGATT ACCTAGGCCA 1500 

GTGGCACGGC AAACTTCAGA ACTCATGGAA TACAGCCTGT TGATTCCTTG TGGATTTTTA 1560 

GCATCTGAGC AGTATCAGTG GCACTCCAAG TGAAATGGTT GATCTCTTCA GTACCGCTGG 1620 

AGCAGATTAA TATTTATTTC ACTGCCAGAG GCTATATTTC TCTGCCCCCC CATACCCCCT 1680 

TCCTCTTCTC CTTAGACTAC TTAAGATTGA AATATTTGCA ACTCCAAATT TATCTTCCTT 1740 

GCTTACCACT TTTTTTTCCG AGCTGGGGAC CAAACCCAGG GCCTTGCGCT TCCTAGGTAA 1800 

GCGCTCTATC ACTGAGCTAA ATCCCCAGCC CCGCTTACCA CATTGTTAAT CTGATCCTTC 1860 

CGCAGTTATT TGTGCTGTTC TTTTTGTTCC TGGCAATAAA TGCTTTTTAT GCTAAAAA 1918 
(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH i 3942 base pairs 

(B) TYPE : 1 nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 



(vii) IMMEDIATE SOURCE: 
<B) CLONE: HW050 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 7.. 1326 

(ix) FEATURE : 

(A) NAME /KEY: misc.feature 

(B) LOCATION: complement (3542.. 3733) 

(D) OTHER INFORMATION: /label= SAC_24644 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 

GTCGAC ATA AAT ATG GCC GTC CGG GTA CCC CTC GGC TGC ACG GGC TCC 48 
lie Asn Met Ala Val Arg Val Pro Leu Gly Cys Thr Gly Ser 
15 10 

TTC TGC CCG CGG CTG CTA CCA CTG CTG GCA TTG TTG GAG CTG CTC GTC 96 
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Phe Cys Pro Arg Leu Leu Pro Leu Leu Ala Leu Leu Glu Leu Leu Val 
15 20 25 30 

GAC CCC AGC TTG GGC CGC GTC CAT CAC CTG GCG CTC AAA GAT GAC GTG 
Asp Pro Ser Leu Gly Arg Val His His Leu Ala Leu Lys Asp Asp Val 

35 40 45 

CGG CAC AAA GTC CAT CTC AAC ACC TTC GGC TTC TTC AAG GAC GGG TAC 
Arg His Lys Val His Leu Asn Thr Phe Gly Phe Phe Lys Asp Gly TVr 

50 55 60 

ATG GTG GTC AAC ATC AGC AGC CTC TCT GTG AAC GAG CCT GAG GGA GTC 
Met Val Val Asn lie Ser Ser Leu Ser Val Asn Glu Pro Glu Gly Val 
65 70 75 

AAG GAC AAA GAC ACT GAG ATT GGC TTC AGT CTT GAT CGG ACC AAG AAT 
Lys Asp Lys Asp Thr Glu He Gly Phe Ser Leu Asp Arg Thr Lys Asn 
80 85 90' 

GAT GGC TTT TCT TCT TAC CTG GAT GAA GAT GTG AAT TAC TGT ATT TTA 
Asp Gly Phe Ser Ser Tyr Leu Asp Glu Asp Val Asn Tyr Cys He Leu 
95 100 105 , HO 

AAA AAA AAG TCC ATG TCT TCT GTC ACT CTG CTC ATC TTA GAC ATC TCC 
Lys Lys Lys Ser Met Ser Ser Val Thr Leu Leu He Leu Asp He Ser 

115 120 , 125 

GGA AGT GGA GTC AAG GTC AGA TCC CCA CCA GAA GCT GGC AAG CAG TTA 
Gly Ser Gly Val Lys Val Arg Ser Pro Pro Glu Ala Gly Lys Gin Leu 

130 135 140 

CCC GAG ATT GTC TTC AGC AAG GAT GAG AAA GTC CCG AGT CGG AGC CAG 
Pro Glu He Val Phe Ser Lys Asp Glu Lys Val Pro Ser Arg Ser Gin 
145 150 155 

GAG CCT GCT GTC AGC TCC AAC CCC AAA GAC AGC AAG GTG CAG AGA ACC 
Glu Pro Ala Val Ser Ser Asn Pro Lys Asp Ser Lys Val Gin Arg Thr 
160 165 170 

CCT GAT GGT TCC AAG GCT CAA CGA AGT ACA GTG GAT TCA AAG ACT ATA 
Pro Asp Gly Ser Lys Ala Gin Arg Ser Thr Val Asp Ser Lys Thr He 
175 180 185 190 

GCA GAG AAA TTC TTC TCA ATT CAT AAG AAT GAT GGG GCA GTT TCA TTT 
Ala Glu Lys Phe Phe Ser He His Lys Asn Asp Gly Ala Val Ser Phe 

195 200 205 

CAG TTC TTC TTT AAC ATC AGC ACC AGT GAC CAG GAA GGC CTC TAC AGC 
Gin Phe Phe Phe Asn He Ser Thr Ser Asp Gin Glu Gly Leu Tyr Ser 

210 215 220 

CTT TAC TTC CAC AAG TGC CCA AGC AGC AAG CTG AGG TCT GGT GAG CAG 
Leu Tyr Phe His Lys Cys Pro Ser Ser Lys Leu Arg Ser Gly Glu Gin 
225 230 235 

GTC TCA TTC AGC CTA AAT ATT GAT ATC ACC GAG AAG AAT CCT GAC AGC 
Val Ser Phe Ser Leu Asn He Asp He Thr Glu Lys Asn Pro Asp Ser 
240 245 250 

TAC CTG TCT GCA GGG GAA ATC CCT CTC CCC AAG TTA TAC GTT TCC ATG 
Tyr Leu Ser Ala Gly Glu He Pro Leu Pro Lys Leu Tyr Val Ser Met 
255 260 265 270 



144 



192 



240 



288 



336 



384 



432 



480 



528 



576 



624 



672 



720 



768 



816 
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GCC TTG TTG TTC TTC CTG TCT GGG ACC GTG TGG ATT CAC ATC CTT CGT 864 
Ala Leu Leu Phe Phe Leu Ser Gly Thr Val Trp He His He Leu Arg 

275 280 285 

AAA CGA AGG AAT GAT GTA TTT AAA ATT CAC TGG TTG ATG GCG GCC CTT 912 
Lys Arg Arg Asn Asp Val Phe Lys He His Trp Leu Met Ala Ala Leu 

290 295 300 

CCT TTC ACC AAG TCT CTC TCC TTG GTG TTC CAT GCA ATC GAC TAC CAC 960 
Pro Phe Thr Lys Ser Leu Ser Leu Val Phe His Ala He Asp Tyr His 
305 310 315 

TAC ATC TCC TCG CAG GGC TTT CCG ATT GAA GGC TGG GCT GTT GTG TAC 1008 
Tyr He Ser Ser Gin Gly Phe Pro He Glu Gly Trp Ala Val Val Tyr 
320 325 330 

TAC ATA ACT CAC CTG CTG AAG GGT GCA CTG CTG TTC ATC ACC ATC GCC 1056 
Tyr He Thr His Leu Leu Lys Gly Ala Leu Leu Phe He Thr He Ala 
335 340 345 350 

CTC ATT GGC ACT GGC TGG GCC TTC ATC AAG CAC ATC CTG TCT GAT AAG 1104 
Leu He Gly Thr Gly Trp Ala Phe He Lys His He Leu Ser Asp Lys 

355 360 365 

GAC AAG AAG ATC TTC ATG ATT GTC ATT CCG CTC CAG GTA CTG GCG AAT 1152 
Asp Lys Lys He Phe Met He Val He Pro Leu Gin Val Leu Ala Asn 

370 ( 375 380 

GTG GCC TAC ATC ATC ATA GAG TCT ACT GAG GAG GGC ACA ACT GAG TAT 1200 
Val Ala Tyr He He He Glu Ser Thr Glu Glu Gly Thr Thr Glu Tyr 
385 390 395 

GGC TTG TGG AAG GAC TCT CTA TTC CTG GTG GAT TTG CTG TGC TGC GGC 1248 
Gly Leu Trp Lys Asp Ser Leu Phe Leu Val Asp Leu Leu Cys Cys Gly 
400 405 410 

GCC ATC CTC TTC CCA GTG GTG TGG TCA ATC AGG CAT TTA CAA GAA GCC 1296 
Ala He Leu Phe Pro Val Val Trp Ser He Arg His Leu Gin Glu Ala 
415 420 425 430 

TCG GCC ACA GAT GGA AAG CTG CCA TTA ACC TAGCAAAGCT GAAGCTTTTC 1346 
Ser Ala Thr Asp Gly Lys Leu Pro Leu Thr 





435 




440 








AGACATTACT 


ACGTCCTGAT 


CGTGTGCTAC 


ATCTACTTCA 


CCAGGATCAT 


CGCCTTTCTT 


1406 


CTCAAGTTCG 


CTGTTCCCTT 


CCAGTGGAAG 


TGGCTCTACC 


AGCTGCTGGA 


TGAAACAGCC 


1466 


ACACTGGTGT 


TCTTTGTCCT 


GACGGGGTAT 


AAATTTCGCC 


CAGCTTCAGA 


TAACCCCTAC 


1526 


CTGCAGCTCT 


CTCAGGAAGA 


AGATGACCTG 


GAGATGGAGT 


CTGTAGTGAC 


GACGTCAGGA 


1586 


GTAATGGAGA 


ACATGAAGAA 


GGTGAAGAAA 


GTGAGCAATG 


GTGCTGTGGA 


GCCCCAGGGC 


1646 


AGCTGGGAAG 


GCACCGCGTG 


ACAGCGCAGC 


TGAGGACCGC 


AGGCCGTCAA 


TGAACCTAGC 


1706 


TTATCCATAG 


CCCTATCAGT 


GAGCACGCCT 


GTTCCTGACA 


GACGATGGCA 


TCTCCCAGCA 


1766 


GCGACACCCC 


GGCGCTCAGC 


ACCATGGCCG 


AGCCGAGTGC 


TATGGACACA 


CACTTTTGTA 


1826 


CTCTTAGGGA 


TCTGGATTGA 


GGTGGGCTAC 


AGGCAGTGAG 


GACTGCCCTC 


AGTGAGGGCG 


1886 


GAGGCTTAGA 


GGAGGAGTAA 


AGACTGTCTG 


TCTGATTTAC 


CTTGCTAATT 


GGGAGCCTGG 


1946 
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GTCTGTGCAG 


GGGAGCCCCC 


AGTGGCTGCT 


ACTCAGAAAG 


GCTGTGGGTG 


TGGCACGGGT 


2006 


CTGCCACTCG 


CTGGTCACAG 


TTCTGTGGAG 


GCTGGGAGGA 


TGTGACTCAC 


TGCTTGGCCT 


2066 


AGCATTTACG 


AGGCTTTGGG 


TTCAATTCCT 


AGTGCCAGGG 


TGGGGGGAAA AGAAGTTACT 


2126 


TTGGGACTAG 


TCAGGTATCT 


ACCAAGGGCA 


CAGAGAAGAG 


AAAGAGATTG 


TCAGCCTTTG 


2186 


GGAACAAGAA 


CAACAGAAAT 


GATTGAAATT 


CTTTTAAATA 


GCCAATAAAC 


CCCTAGGTGG 


2246 


CAGAAGACAC 


CTGTCCAGGG 


TCACTGGCCC 


CCAGGCACCA 


CTCTAAATGT 


CCCTGCCCCT 


2306 


GATGTGACCT 


ACAACAGTGG 


TGCTAGAGGC 


CAGAGGGAGG 


GAGGGAGGGG 


GCCTGTTACA 


2366 


TGGCTGGCTG 


CAGGGGAGAA 


GAGCCTTGGC 


TCTCCAGAAG 


TGTTCTCCAG 


TTGGCTCTCA 


2426 


GGACTCCATC 


CCCTGGCTGC 


CTGCCCAGAG 


CAGCCGCCTG 


GCTTCCTCTC 


CTTCCACCTG 


2486 


TGTCTTCTGA 


TCCGAGAAAA 


CAGTTTTCCT 


GAAGGAGCAG 


CCGAGGACAG 


AAAGGGTAGT 


2546 


CTCTAAACAA 


ACACTTAGGT 


TAAGCATCTT 


CCACAGCTCG 


CATGGGCCAG 


GCAGGACCTC 


2606 


TGAGCCTCAG 


CTATAGTCCA 


AGCATGCGTG 


AAGGAAGTGT 


TTGAGCCACT 


CATGCTCAGA 


2666 


GACCTGAGGC 


AGCCGGGTCT 


GCTCAGAGCT 


TGGACAGGGA 


CACCAGCGGC 


ACCTGCGGAA 


2726 


GGCCTGTGAG 


GGGCCACAGG 


AGACCAGCTC 


AGCACCTCAG 


AGCTGCTGCC 


TGCCCAGAAC 


2766 


TGGCGGGCGA 


GTTGCACCTT 


CAAGGAACAG 


GCACTTCCGG ' 


GAGGGGGGTG 


AGTTACTCCA 


2846 


CCTCCCAGTG 


CAGAAGACAA 


AGGTCTGGTC 


TCCACACCAC 


ACCCCAAGTC 


CCTGTCCTGT 


2906 


GCCACCAGTG 


AGAAGCTGCT 


GTCTGCTGCC 


CTAGCAGGTC 


CGTCACTACC 


GCAGGTGCTC 


2966 


CATCTTGAGA 


AAGGTCGACA 


GGCCTGAAGG 


ATCCACCTAA 


AGGACCTTTT 


TCTCCTGTTT 


3026 


TGTTTGCTTT 


GTTTATACTT 


GTTTAAAGTA 


TGTGGTGGGA 


AGGGAGTGTC 


GAGGAGACAA 


3086 


GGTGTTAATG 


CTTTGTCTTC 


TGCACATAAG 


CACTCCTCCA 


GCACTGCAGA 


AGTGTGTCCC 


3146 


TGTGGTGACT 


CTAAGCCACA 


GCTGTCCCTC 


AGAGGGCGCG 


ATCTTTATGG 


AAGGAGCCTC 


3206 


CTTTCACTTG 


CTGGGTTTTA 


ATAAGGCCAC 


TCTGACAGAG 


AGCAGAGGGA 


GCTGTGTACA 


3266 


ACCCAGCCAC 


AGAGCTCTCT 


GGGTGCACAG 


CTGAGCGGCT 


ATGGGAGACA 


GCCCGCTCCT 


3326 


TCCAGAGCAT 


GGGTGCACAG 


GCAGGGAGGT 


GTGCTACAGC 


AGCGGCCCTC 


CCAAGTCCTG 


3386 


CCCTGTGACT 


GCTGTAGGTG 


GGAAGCCAGG 


CATGTTGGGG 


GTTGTGGTGG 


GGAGGGGGGA 


3446 


TCAGAAGCCT 


AACATGGGGC 


TGGGAAGAAC 


TGGCCTGGTG 


CCCAGGCTGT 


GGAAAGGAAA 


3506 


ACTCCCAGCC 


CTCAGCATGC 


ACTGTAAGGT 


GTCTGTCTCA 


TCTGCATCGC 


TTGAGAGCTG 


3566 

tfr ttm V W 


GGAGCCCGCA 


TTCAGCAGTG 


TCCTTGGGTA GAGGAGTCTG ATTGGCATCC AGCCGCCCCA 


3626 


GTTCATGAGC 


TGCCCTCTGC 


AGCCTTGAGT 


CTTAGCCTAC 


CCAGCTCCAC 


AACCGTGTTT 


3686 


GAGTGGTGAA GCCCTTGTGT TGGAGGGGCT GAGCAGAAAT GGGGTGTACA TTGCACTGCC 


3746 


ACTGTGGTAT 


TAATAGCTCT 


GTGTTGTGTC 


CCCACCCCCA 


CCCCCCATCT 


TCTGCCTGCG 


3806 
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TGAGGTATCC ATACTTGTTT TGAGAGTGCT GTGGACTGGA AGCTGTGGGC TGTGACATTT 3866 
AATAAAGTGT GGATTTTGTT AGATAATTAT TTCTTGGACA ACAGGAACAA GTGTTCAGCT 3926 



(2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 440 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii> MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 

lie Asn Met Ala Val Arg Val Pro Leu Gly Cys Thr Gly Ser Phe Cys 
1 5 10 15 

Pro Arg Leu Leu Pro Leu Leu Ala Leu Leu Glu ( Leu Leu Val Asp Pro 

20 25 30 

Ser Leu Gly Arg Val His His Leu Ala Leu Lys Asp Asp Val Arg His 
35 40 < 45 

Lys Val His Leu Asn Thr Phe Gly Phe Phe Lys ,Asp Gly Tyr Met Val 
50 55 ,60 

Val Asn lie Ser Ser Leu Ser Val Asn Glu Pro Glu Gly Val Lys Asp 
65 70 75 80 

Lys Asp Thr Glu lie Gly Phe Ser Leu Asp Arg Thr Lys Asn Asp Gly 

85 90 95 

Phe Ser Ser Tyr Leu Asp Glu Asp Val Asn Tyr Cys lie Leu Lys Lys 

100 105 110 

Lys Ser Met Ser Ser Val Thr Leu Leu lie Leu Asp lie Ser Gly Ser 
115 120 125 

Gly Val Lys Val Arg Ser Pro Pro Glu Ala Gly Lys Gin Leu Pro Glu 
130 135 140 

lie Val Phe Ser Lys Asp Glu Lys Val Pro Ser Arg Ser Gin Glu Pro 
145 150 155 160 

Ala Val Ser Ser Asn Pro Lys Asp Ser Lys Val Gin Arg Thr Pro Asp 

165 170 175 

Gly Ser Lys Ala Gin Arg Ser Thr Val Asp Ser Lys Thr lie Ala Glu 

180 185 190 

Lys Phe Phe Ser lie His Lys Asn Asp Gly Ala Val Ser Phe Gin Phe 
195 200 205 

Phe Phe Asn lie Ser Thr Ser Asp Gin Glu Gly Leu Tyr Ser Leu Tyr 
210 215 220 

Phe His Lys Cys Pro Ser Ser Lys Leu Arg Ser Gly Glu Gin Val Ser 
225 230 235 240 



CTGAAAAAAA GTCGAC 



3942 
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Phe Ser Leu Asn lie Asp lie Thr Glu Lys Asn Pro Asp Ser Tyr Leu 

245 250 255 

Ser Ala Gly Glu lie Pro Leu Pro Lys Leu Tyr Val Ser Met Ala Leu 

260 265. 270 

Leu Phe Phe Leu Ser Gly Thr Val Trp He His He Leu Arg Lys Arg 
275 280 285 

Arg Asn Asp Val Phe Lys He His Trp Leu Met Ala Ala Leu Pro Phe 
290 295 300 

Thr Lys Ser Leu Ser Leu Val Phe His Ala He Asp Tyr His Tyr He 
305 310 315 320 

Ser Ser Gin Gly Phe Pro He Glu Gly Trp Ala Val Val Tyr Tyr He 

325 330 335 

Thr His Leu Leu Lys Gly Ala Leu Leu Phe He Thr He Ala Leu He 

340 345 350 

Gly Thr Gly Trp Ala Phe He Lys His He Leu Ser Asp Lys Asp Lys 
355 360 365 

Lys He Phe Met He Val He Pro Leu Gin Val Leu Ala Asn Val Ala 
370 375 ' 380 

Tyr He He He Glu Ser Thr Glu Glu Gly Thr Thr Glu Tyr Gly Leu 
385 390 395 400 

Trp Lys Asp Ser Leu Phe Leu Val Asp Leu Leu Cys Cys Gly Ala He 

405 410 415 

Leu Phe Pro Val Val Trp Ser He Arg His Leu Gin Glu Ala Ser Ala 

420 425 430 

Thr Asp Gly Lys Leu Pro Leu Thr 
435 440 

(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1838 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 



(vii) IMMEDIATE SOURCE: 
(B) CLONE : HW051 

(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 114.. 1505 

(ix) FEATURE: 

(A) NAME /KEY: mis cofeature 

(B) LOCATION: 1411.. 1603 

(D) OTHER INFORMATION: /label* SAC.24170 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 

GTCGACGGCT GGTTTGAAAA GTGACAACGG CCGGTGGATT TTAGGAGTTT GCTCGGTTTG 60 

TAACTGCTCT TTGGTGAGCT ACTGGGACTG CAGACTAGGA GGAGACTCCC AAA ATG 116 

Met 
1 

GAA ACT CTG TCC TTC CCC AGA TAC AAC ATA GCT GAG ATT GTA GTT CAT 164 
Glu Thr Leu Ser Phe Pro Arg Tyr Asn lie Ala Glu lie Val Val His 

5 10 15 

ATT CGC AAT AAA CTG TTA ACT GGA GCG GAT GGC AAA AAC CTC TCC AAG 212 
lie Arg Asn Lys Leu Leu Thr Gly Ala Asp Gly Lys Asn Leu Ser Lys 
20 ( 25 30 

AGC GAT TTT CTT CCA AAC CCG AAG CCT GAA GTC CTG TAC ATG ATT TAC 260 
Ser Asp Phe Leu Pro Asn Pro Lys Pro Glu Val Leu Tyr Met lie Tyr 
35 40 45 

ATG AGA GCC TTA CAG TTA GTG TAT GGG GTC CGG CTG GAG CAT TTC TAC 308 
Met Arg Ala Leu Gin Leu Val Tyr Gly Val Arg Leu Glu His Phe Tyr 

50 55 60 65 

1 t 

ATG ATG CCG GTG AAC ATA GAA GTC ATG TAT CCA CAT ATA ATG GAG GGC 356 
Met Met Pro Val Asn lie Glu Val Met Tyr Pro His lie Met Glu Gly 

70 75 80 

r 

TTC TTA CCG GTC AGC AAT TTG TTC TTC CAC CTG GAC TCG TTT ATG CCC 404 
Phe Leu Pro Val Ser Asn Leu Phe Phe His Leu Asp Ser Phe Met Pro 

85 90 95 

ATT TGC CGG GTG AAT GAC TTT GAG ATC GCC GAT ATT CTT TAT CCA AAA 452 
He Cys Arg Val Asn Asp Phe Glu He Ala Asp He Leu Tyr Pro Lys 
100 105 110 

GCA AAC CGG ACA AGT CGT TTT TTA AGT GGC ATT ATC AAC TTT ATT CAC 500 

Ala Asn Arg Thr Ser Arg Phe Leu Ser Gly He lie Asn Phe He His 
115 120 125 

TTC AGA GAA ACA TGC CTG GAG AAG TAT GAA GAA TTT CTT TTG CAA AAT 548 
Phe Arg Glu Thr Cys Leu Glu Lys Tyr Glu Glu Phe Leu Leu Gin Asn 
130 135 140 145 

AAA TCC TCT GTG GAC AAA ATA CAG CAG TTA AGC AAT GCA CAC CAG GAA 596 
Lys Ser Ser Val Asp Lys He Gin Gin Leu Ser Asn Ala His Gin Glu 

150 155 160 

GCA TTG ATG AAA CTG GAA AAA CTC AAT TCG GTT CCC GTG GAG GAG CAG 644 
Ala Leu Met Lys Leu Glu Lys Leu Asn Ser Val Pro Val Glu Glu Gin 

165 170 175 

GAA GAG TTC AAA CAG CTG AAG GAT GAC ATC CAG GAG CTG CAG CAC TTG 692 
Glu Glu Phe Lys Gin Leu Lys Asp Asp He Gin Glu Leu Gin His Leu 
180 185 190 

CTG AAT CAA GAC TTC AGA CAG AAA ACG ACA CTG CTG CAG GAG AGA TAT 740 
Leu Asn Gin Asp Phe Arg Gin Lys Thr Thr Leu Leu Gin Glu Arg Tyr 
195 200 205 

ACC AAA ATG AAA TCA GAT TTT TCA GAG AAA ACC AAG CAT GTT AAT GAG 788 
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Thr Lys Met Lys Ser Asp Phe Ser Glu Lys Thr Lys His Val Asn Glu 
210 215 220 225 



CTA AAG TTG TCA GTA GTT TCT TTG AAA GAA GTT CAA GAC AGT TTG AAA 
Leu Lys Leu Ser Val Val Ser Leu Lys Glu Val Gin Asp Ser Leu Lys 

230 235 240 



836 



AGC AAA ATT GTG GAT TCT CCA GAG AAG CTG AAG AAC TAT AAA GAG AAG 
Ser Lys He Val Asp Ser Pro Glu Lys Leu Lys Asn Tyr Lys Glu Lys 

245 250 255 



884 



ATG AAG GAC ACC GTC CAG AAG CTC CGC AGT GCC AGG GAA GAA GTG ATG 
Met Lys Asp Thr Val Gin Lys Leu Arg Ser Ala Arg Glu Glu Val Met 
260 265 270 



932 



GAG AAG TAT GAT ATC TAT AGA GAT TCT GTG GAT TGC TTG CCT TCC TGT 
Glu Lys Tyr Asp He Tyr Arg Asp Ser Val Asp Cys Leu Pro Ser Cys 
275 280 285' 



980 



CAG CTG GAG GTG CAG TTA TAT CAA AAG AAA TCA CAG GAC CTT GCA GAT 
Gin Leu Glu Val Gin Leu Tyr Gin Lys Lys Ser Gin Asp Leu Ala Asp 
290 295 300 , 305 



1028 



AAT AGG GAG AAA CTA AGC AGT ATC TTA AAG GAG AGC CTG AAC CTG GAG 
Asn Arg Glu Lys Leu Ser Ser He Leu Lys Glu Ser Leu Asn Leu Glu 

310 315 . 320 



1076 



GGC CAG ATT GAT AGT GAT TCA TCA GAA CTA AAG AAA CTG AAG ACT GAA 
Gly Gin He Asp Ser Asp Ser Ser Glu Leu Lys Lys Leu Lys Thr Glu 

325 330 1 335 



1124 



GAG AAC TCC CTC ATA AGA CTG ATG ACT CTA AAG AAG GAG AGA CTT GCT 
Glu Asn Ser Leu He Arg Leu Met Thr Leu Lys Lys Glu Arg Leu Ala 
340 345 350 



1172 



ACC ATG CAG TTC AAA ATA AAC AAG AAG CAG GAG GAT GTG AAA CAG TAC 
Thr Met Gin Phe Lys He Asn Lys Lys Gin Glu Asp Val Lys Gin Tyr 
355 360 365 



1220 



AAG CGG ACC ATG ATT GAA GAT TGC AAT AAA GTT CAA GAA AAA AGA GAT 
Lys Arg Thr Met He Glu Asp Cys Asn Lys Val Gin Glu Lys Arg Asp 
370 375 380 385 



1268 



GCT GTC TGC GAG CAA GTA ACC GCC ATT AAT CAA GAC ATC CAC AAG ATT 
Ala Val Cys Glu Gin Val Thr Ala He Asn Gin Asp He His Lys lie 

390 395 400 

AAA TCT GGG ATT CAG CAG CTA AGA GAC GCC GAA AAA CGG GAG AAA CTG 
Lys Ser Gly He Gin Gin Leu Arg Asp Ala Glu Lys Arg Glu Lys Leu 

405 410 415 



1316 



1364 



AAG TCT CAG GAA ATC TTG GTA GAC TTG AAA AGT GCT TTG GAG AAG TAC 
Lys Ser Gin Glu He Leu Val Asp Leu Lys Ser Ala Leu Glu Lys Tyr 
420 425 430 



1412 



CAT GAG GGC ATC GAG AAG ACG ACG GAG GAG TGC TGC ACT AGA ATA GGA 
His Glu Gly He Glu Lys Thr Thr Glu Glu Cys Cys Thr Arg He Gly 
435 440 445 



1460 



GGG AAG ACT GCC GAG CTG AAG AGG AGG ATG TTC AAA ATG CCG CCC 
Gly Lys Thr Ala Glu Leu Lys Arg Arg Met Phe Lys Met Pro Pro 
450 455 460 



1505 
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TGATCAACAG CCACCCGAAA ATGGCCTTTC GCTTTCTGTT TGGAGTAGTT ATATTGAAGC 1565 

TAATAGAAGG ACCGCAGTCT CAGCTAACTA GCGTGGGTAC CATTGGTTCT CTGTCCTTTT 1625 

ATGACCATGT GCTCCCTGTG TTTTTTTCTT GGTGATGGGG ATACAACTCA GGGCCTTGCA 1685 

GGCTACACTG ACTTGCCTCC CTAGGCTCTA ATGTACCATG TACTATGTAG GCTTTTGCTA 1745 

CAATTAAAGT AACGTGTACA GCTTTTATGT CCCTACTCTG TCTCCTTTTG TATGTGCTGG 1805 

TTGGAATAAA CAAATAGTTA CTGACGTCAA AAA 1838 

(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 464 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 

Met Glu Thr Leu Ser Phe Pro Arg Tyr Asn lie Ala Glu lie Val Val 
1 5 10 < 15 

His He Arg Asn Lys Leu Leu Thr Gly Ala Asp Gly Lys Asn Leu Ser 

20 25 30 

Lys Ser Asp Phe Leu Pro Asn Pro Lys Pro Glu Val Leu Tyr Met He 
35 40 45 

Tyr Met Arg Ala Leu Gin Leu Val Tyr Gly Val Arg Leu Glu His Phe 
50 55 60 

Tyr Met Met Pro Val Asn He Glu Val Met Tyr Pro His He Met Glu 
65 70 75 80 

Gly Phe Leu Pro Val Ser Asn Leu Phe Phe His Leu Asp Ser Phe Met 

85 90 95 

Pro He Cys Arg Val Asn Asp Phe Glu He Ala Asp He Leu Tyr Pro 

100 105 110 

Lys Ala Asn Arg Thr Ser Arg Phe Leu Ser Gly He He Asn Phe He 
115 120 125 

His Phe Arg Glu Thr Cys Leu Glu Lys Tyr Glu Glu Phe Leu Leu Gin 
130 135 140 

Asn Lys Ser Ser Val Asp Lys He Gin Gin Leu Ser Asn Ala His Gin 
145 150 155 160 

Glu Ala Leu Met Lys Leu Glu Lys Leu Asn Ser Val Pro Val Glu Glu 

165 170 175 

Gin Glu Glu Phe Lys Gin Leu Lys Asp Asp He Gin Glu Leu Gin His 

180 185 190 

Leu Leu Asn Gin Asp Phe Arg Gin Lys Thr Thr Leu Leu Gin Glu Arg 
195 200 205 
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Tyr Thr Lys Met Lys Ser Asp Phe Ser Glu Lys Thr Lys His Val Asn 
210 215 220 

Glu Leu Lys Leu Ser Val Val Ser Leu Lys Glu Val Gin Asp Ser Leu 
225 230 235 240 

Lys Ser Lys He Val Asp Ser Pro Glu Lys Leu Lys Asn Tyr Lys Glu 

245 250 255 

Lys Met Lys Asp Thr Val Gin Lys Leu Arg Ser Ala Arg Glu Glu Val 

260 265 270 

Met Glu Lys Tyr Asp He Tyr Arg Asp Ser Val Asp Cys Leu Pro Ser 
275 280 285 

Cys Gin Leu Glu Val Gin Leu Tyr Gin Lys Lys Ser Gin Asp Leu Ala 
290 295 300 

Asp Asn Arg Glu Lys Leu Ser Ser He Leu Lys Glu Ser Leu Asn Leu 
305 310 315 320 

Glu Gly Gin He Asp Ser Asp Ser Ser Glu Leu Lys Lys Leu Lys Thr 

325 330 335 

Glu Glu Asn Ser Leu He Arg Leu Met Thr Leu Lys Lys Glu Arg Leu 

340 ' 345 * 350 

Ala Thr Met Gin Phe Lys He Asn Lys Lys Gin Glu Asp Val Lys Gin 
355 360 365 

Tyr Lys Arg Thr Met He Glu Asp Cys Asn Lys Val Gin Glu Lys Arg 
370 375 380 

Asp Ala Val Cys Glu Gin Val Thr Ala He Asn Gin Asp He His Lys 
385 390 395 400 

He Lys Ser Gly He Gin Gin Leu Arg Asp Ala Glu Lys Arg Glu Lys 

405 410 415 

Leu Lys Ser Gin Glu lie Leu Val Asp Leu Lys Ser Ala Leu Glu Lys 

420 425 430 

Tyr His Glu Gly He Glu Lys Thr Thr Glu Glu Cys Cys Thr Arg He 
435 440 445 

Gly Gly Lys Thr Ala Glu Leu Lys Arg Arg Met Phe Lys Met Pro Pro 
450 455 460 



(2) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1483 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 



(vii) IMMEDIATE SOURCE: 
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<B) CLONE: HW052 

(ix) FEATURE: 

(A) NAME /KEY; CDS 

<B) LOCATION: 247.. 76 5 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION : complement (660.. 1062) 

(D) OTHER INFORMATION: /label* SAC_24882 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 

GTCGACCAAA AACAGGACAG ATTGCAACAG CCTCTACACT JTCCTTCATGG CCTGCTCTTG 60 

GGCCGTGGCG ACAAGCAGAA AGAAGCTTGA TTGTGGAGAG GTGCCCTAGC GGAGGGATGT 120 

CTGCCACAGT CCAGGCACCC CACAGAGGGT AGGAGCTCCA TGTACGAGTG GCTTTGCATT 180 

TCTTCAGAAA AGAATAGCTG ATGTGCAATC TCAGCCTTTC TGCAGAAGGA TTTATCAGGG 240 

i 

GCCAGC ATG GGC TCT GTG AAT TCT AGA GGC CAT AAG GCC CAG GTG GTA 288 
Met Gly Ser Val Asn Ser Arg Gly His Lys Ala Gin Val Val 
15 10 

ATG CTG GGC CTC GAC TGT GCT GGC AAG ACC ACA ATC CTG TAG AAA CTG 336 
Met Leu Gly Leu Asp Cys Ala Gly Lys Thr Thr IJ.e Leu Tyr Lys Leu 
15 20 25. 30 

AAA GGA AAC CGG CTG GTG GAT ACC CTA CCC ACT GTT GGT TTT AAT GTA 384 
Lys Gly Asn Arg Leu Val Asp Thr Leu Pro Thr Val Gly Phe Asn Val 

35 40 45 

GAG CCT CTT GAG GCT CCT GGA CAT GTG TCG CTG ACT CTC TGG GAC ATT 432 
Glu Pro Leu Glu Ala Pro Gly His Val Ser Leu Thr Leu Trp Asp lie 

50 55 60 

GGG GGA CAG ACC CAG CTC AGG GCT ACG TGG AAG GAC TAC CTG GAA GGC 480 
Gly Gly Gin Thr Gin Leu Arg Ala Thr Trp Lys Asp Tyr Leu Glu Gly 
65 70 75 

ATT GAC CTC CTT GTG TAC GTG CTG GAC AGC ACA GAT GAA GCC CGC TTG 528 
lie Asp Leu Leu Val Tyr Val Leu Asp Ser Thr Asp Glu Ala Arg Leu 
80 85 90 

CCC GAG GCA GTG GCT GAG CTC GAG GAA GTC CTA GAA GAC CCC AAC ATG 576 
Pro Glu Ala Val Ala Glu Leu Glu Glu Val Leu Glu Asp Pro Asn Met 
95 100 105 110 

GCT GGT GTT CCT TTC TTG GTA CTG GCC AAC AAG CAG GAG GCT CCT GAT 624 
Ala Gly Val Pro Phe Leu Val Leu Ala Asn Lys Gin Glu Ala Pro Asp 

115 120 125 

GCT CTA CCA TTG CTT GAA ATC AGA AAC AGG CTG GAC CTG GAA AGG TTC 672 
Ala Leu Pro Leu Leu Glu lie Arg Asn Arg Leu Asp Leu Glu Arg Phe 

130 135 140 

CAA GAC CAT TGC TGG GAG CTG CGG GCT TGC AGC GCT CTC ACA GGT CAG 720 
Gin Asp His Cys Trp Glu Leu Arg Ala Cys Ser Ala Leu Thr Gly Gin 
145 150 155 

GGG CTA CAG GAA GCC CGG CAG AGC CTG CTG CAT TTG CTG AGA TCC 765 
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Gly Leu Gin Glu Ala Arg Gin Ser Leu Leu His Leu Leu Arg Ser 



160 




165 




170 






TGATGAAATG 


CAGTAACCTC 


TAACTCCAGG 


CTGGAGCAGG 


ACAGGGAGCA 


GGGTCAGCCA 


825 


GACCCCCTGA 


GCAGGCAAAT 


AGATTATCCT 


TGCTTCTAAG 


AACAGGAGGG 


GCCAGCTGAT 


885 


CCTTGAGAAA 


GTGAAGCTCC 


GTTTGCCACA 


TGAGAAACCA 


CGTTTCTCTT 


GGACGGTGTG 


945 


TTTTCTTCTC 


AGGACTTCTT 


CTCAGTGGCG 


GTTATGCGGA AAATCAACTG 


TTCTTCAAGT 


1005 


AACAATCGTC 


AGAGGGAATA 


TAAAAACCCT 


CAGAATACCC 


CCATGTAACA 


GCTCAGTACC 


1065 


AAGTGTCAAA 


GTCACAAACT 


TGGTGACATA 


ATTATTGATG 


ACCAAGTTCA 


CAAATGAAAA 


1125 


GAGGTTTCAA 


AACTACATCT 


GAGTGTCACT 


GGGCCTATCG 


CTGTTATCCG 


AGTGTCACTG 


1185 


GGCCTATCAT 


TGTCATCCGA 


GTGTCACCAG 


GCCTATCACG 


GTCATCTGCC 


TGTCACTGGG 


1245 


CCTATCACAG 


TCATCAGGAG 


CGACTCTTCC 


TACTGGCCTT 


GGGGCAGAAG 


CCACTGTCAC 


1305 


TGGTGGGCTC 


ACAGCAGGAG 


GGGCAGGTCT 


GCTCTCATAA TGCTGCAGAA 


GGCAGAAGTC 


1365 


CATGGGGCCC 


TAGATCCTCC 


CAGAATTCCC 


CCTGATCTCC 


AACTCTGACT 


AACTCTTTTC 


1425 


TGCCCTATAT 


CCTCTGTTGT 


AAAAAAGTAA 


AATAAAACTC 


CTCTTATACT 


CTCAAAAA 


1483 



(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 173 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:47: 

Met Gly Ser Val Asn Ser Arg Gly His Lys Ala Gin Val Val Met Leu 
15 10 15 

Gly Leu Asp Cys Ala Gly Lys Thr Thr lie Leu Tyr Lys Leu Lys Gly 

20 25 30 

Asn Arg Leu Val Asp Thr Leu Pro Thr Val Gly Phe Asn Val Glu Pro 
35 40 45 

Leu Glu Ala Pro Gly His Val Ser Leu Thr Leu Trp Asp He Gly Gly 
50 55 60 

Gin Thr Gin Leu Arg Ala Thr Trp Lys Asp Tyr Leu Glu Gly He Asp 
65 70 75 80 

Leu Leu Val Tyr Val Leu Asp Ser Thr Asp Glu Ala Arg Leu Pro Glu 

85 90 95 

Ala Val Ala Glu Leu Glu Glu Val Leu Glu Asp Pro Asn Met Ala Gly 

100 105 110 

Val Pro Phe Leu Val Leu Ala Asn Lys Gin Glu Ala Pro Asp Ala Leu 
115 120 125 
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Pro Leu Leu Glu lie Arg Asn Arg Leu Asp Leu Glu Arg Phe Gin Asp 
130 135 140 

His Cys Trp Glu Leu Arg Ala Cys Ser Ala Leu Thr Gly Gin Gly Leu 
145 150 155 160 

Gin Glu Ala Arg Gin Ser Leu Leu His Leu Leu Arg Ser 

165 170 

(2) INFORMATION FOR SBQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2288 base pairs 

(B) TYPE: nucleic acid 5 

(C) STRANDEDNESS : ' double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: ' HW054 

(ix) FEATURE: 1 

(A) NAME /KEY: misc_f eature 

(B) LOCATION: 1952.. 2152 

(D) OTHER INFORMATION: /label- SAC_24183 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 



GTCGACGGTG 


GGGTCTAACT 


GAGGTCTTGT 


ATTTGGTTTT 


TCCTGAAGAT 


CGACCTCTGG 


60 


CATCAGGAGG 


CTTGAGCAAG 


GAGAAGAGTG 


GATGATGAAG 


GAAAGAGACA 


AGGAAAGAAA 


120 


GAGAAGTGTG 


CACAAAGAAA 


CTTGTATTAT 


TATTAATTAG 


CACCTAGCTT 


GTTTGTGTCT 


180 


GATACACCAC 


CAAGTAGTAA 


TTGTTGAAAA 


AAGCGAAGAA 


GAAAAAAAAA 


AAAACAAAAA 


240 


AACCAAACAG 


TGGGTACTCA 


AATAAGATAG 


GAGAAAAATG 


AAGAGAACAG 


ACCCAGTTCT 


300 


CGACCCTTGC 


TTCTCGAAGG 


TCCTCCCACC 


AGGCTGCCAA 


AGCAAGATGG 


TGTTGCTCTG 


360 


ATCCAGTCAG 


TATTCTTTTG 


ACTTTTTTTT 


TTAATCTCCA 


GGTTTTGGTT 


CAGGCTCCCA 


420 


TATTCATACC 


CTGGCTCATT 


TAGCTTTCCC 


TCATGTTGTG 


GGTTCTTCTG 


TCCCTCACCC 


460 


CCTTACTCTC 


CCCACTGATA 


TTCTTCCCAC 


GTCAAGACTG 


TGGCTCTGGA 


AGAAATATCC 


540 


ACCATTTGCA 


GAGCTGATGT 


TCTGTAGATC 


GTAATGTTGA AGCGCTGGGT 


GTCCTGGTTG 


600 


GCAGAATCAC 


TCCTGTATTA 


CTCTGGTACA 


TAGGTGTCTC 


CTGNTAGACT 


CCCTGGCCTT 


660 


AGTCATGGGG 


TGTTTTCTAG 


AGGCAGACTA 


AGACAGGAGT 


CAAAAAAGAT 


TTAGAGGAAG 


720 


GAGCTGAGGA 


AAGAAAGACA 


GTTGTGGGAG 


GAAAATCAAG 


TTCTACTCAG 


GATCCCGAGT 


780 


GTTTCTGTAG 


ATGTAGATTG 


GAATGTGTCC 


ATAACAGAGA 


GGCCAGTGAG 


AGACATCCCC 


840 
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AAGGACCTGC 


CAGGCTTTCC 


TTCGCTCCAG 


GAAGACGCAC 


CATCACTCAA 


AAGGGGTTTC 


900 


CTAGAAAGAA 


AGACAAGTGA 


CTTAAAAAAT 


CTGCCAGTGG 


GTTCTTGAAG 


TCATCGAACC 


960 


TAGGAGGTAG 


AAAGCTCACG 


GGGGTCATGT 


TATCGAAGGG 


AGGTGAGGTT 


CAGTCAGGCT 


1020 


AACTATTAGG 


GAGANTGTCA 


AAGGAGGCTA 


CAGAAATGGG 


ATTCTGGCTG 


TGGGGCATTA 


1080 


AAGGCTATCA 


AGTCAACAGG 


TAAGAGTGGA 


GTGGCTCAGC 


AGCTTCTCTG 


GCGCTGAAAG 


1140 


TGGTGTCTCT 


TAACAGTTAA 


TTCCCTTCCC 


TTTTCCATCA 


CAGAAGGAAT 


CATTCCAGCC 


1200 


CTCCCTGGGA 


TTATGGTCCC 


CCAGTGACCA 


TGTTCCATCT 


GCTGGATTTT 


TGATCAGTTT 


1260 


AGAAGGAAAA 


AAACACTTGG 


GTTCGGTGGT 


ATTTGCTTAT 


GACCTTAGCA 


CTTTGGAGCT 


1320 


GAGAAGAGGA 


TTGCCATAGT 


TTCCAGACCA 


GTCTGGTTTA CATAGTGAGT 


TCTAGGGCTG 


1380 


CGTAATGAGA 


TGTCTCAAGA 


ATGGAGAGGG 


GCGTTAGGGA 


GAACTTACTT 


AGCTCTTCAA 


1440 


GAAACTGTCC 


TGTAAACAAA 


GAGAAATGAG 


TCGATGGGTG 


ACCAGTGGGA 


GCCTNCCCCC 


1500 


CACCTCCTCA 


GACAACCCTT 


CTTTTCCATT AAGGGGCCTG TGCATCTCAG 


TAAGTGGTCC 


1560 


TCAGTTTTCC 


TGAGGATTCT 


GATGGTGAGA 


GTAGAGAGAG 


CAAAAAGATT 


CTCAGCCCTG 


1620 


GGATCTAGAA 


CCTTGGTTCC 


ACTCTTTTCC 


CGTTCTGGTC 


r 

TGCCTGAGCT 


CCCCCCCCCC 


1680 


GCCCCCCAGT 


GGTCCTGTTT 


GCTGTCCAGA 


CATTGCACCT 


GCCAGTCTTC 


GGTGCAGCCA 


1740 


GTTGTAGCAG 


ATGATCAAAG 


CAACCAGNGG 


TCCAAGAACA 


GGGACAATCA 


ACAAACAGTG 


1800 


TTATCTTCCA 


GCAGGGCACT 


CTCAGGAAGT 


GAGGATCTGA ATGGTAAAGC 


AGAACAGAGT 


1860 


GTTTCCCTCT 


ATGCCACAGC 


CGCTCATGCC 


ACACCCACTT 


GAAGCCTCCA 


AAGCCAGNAC 


1920 


AAGCTGTAGA 


GTCCATGAAT 


GTAGATGAGG 


TACATTGTGC 


GCCTTGCTTT 


TACTTCCTTT 


1980 


AGCTTCTGAT 


TGAATTTAGC 


TGTATCTTTT 


TAACCGATTT 


ATCGTAGAAN 


ACAATGGGTG 


2040 


GTGTGACCCT 


NTCATGCTTT 


GTTGATCTTC 


AGCCACACGC 


TGCCCTCCCT 


CGCCCCCACT 


2100 


TCTAGCATTT 


GTGTGTGTGC 


GCGTGTTTTT 


TTGAAGAGGG 


GTTNTTGCTT 


GTACAATCCC 


2160 


TGGCTGGCCT 


TGAACTTGAG 


AAGTCCTCCT 


GCCTCCGCCT 


TCCAGGTGCT 


GGGATTACAA 


2220 


GTTGTCAACC 


CTGCTTTCAC 


TAATGTTTCC 


TTTTTATTAA AGATTTATTT 


ATTATTTAAA 


2280 


AAGTCGAC 












2288 


<2) INFORMATION FOR SEQ ID NO: 49 


* 
* 









(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2349 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY; linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
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(vii) IMMEDIATE SOURCE: 
(B) CLONE: HW055 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 214.. 1329 

(ix) FEATURE: 

(A) NAME/ KEY: misc^feature 

(B) LOCATION: 1953.. 2065 

(D) OTHER INFORMATION: /label* SAC_24449 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49; 
GTCGACAATG CTTTCAAGCA AACAGCCGCT ACCAGAACAA CTCTGTGACT AGGAGAAAGA 60 

! 

GGCCAAGGCG CGAGTTCCCA AACCCTGAAG CCACAGGAGC CCAAGGGGGA GACTGGAGAG 120 

AAGGTATTTC TAGCCGTGCA CAGTGACTGA TCAGCCGAGG ACCTACCCTC GAGTTCTATT 180 

CTGCAACAGT GAACAGTGAA GGTCTCCAAA GAG ATG GAG i TAC GAG GGT TAC AAC 234 

Met Glu Tyr Glu Gly Tyr Asn 
1 5 

GAC TCC AGC ATC TAC GGT GAG GAG TAT TCT GAC GGC TCG GAC TAC ATC 282 
Asp Ser Ser He Tyr Gly Glu Glu Tyr Ser Asp Gly Ser Asp Tyr He 
10 15 20 

GTG GAC TTG GAG GAG GCG GGT CCA CTG GAG GCC AAG GTG GCC GAG GTC 330 
Val Asp Leu Glu Glu Ala Gly Pro Leu Glu Ala Lys Val Ala Glu Val 
25 30 35 

TTC CTG GTG GTA ATC TAC AGC TTG GTG TGC TTC CTC GGG ATC CTA GGC .378 
Phe Leu Val Val He Tyr Ser Leu Val Cys Phe Leu Gly He Leu Gly 
40 45 50 55 

AAT GGC CTG GTG ATT GTC ATC GCC ACC TTC AAG ATG AAG AAG ACG GTG 426 
Asn Gly Leu Val He Val He Ala Thr Phe Lys Met Lys Lys Thr Val 

60 65 70 

AAC ACC GTG TGG TTT GTC AAC CTG GCC GTG GCT GAC TTC CTG TTC AAC 474 
Asn Thr Val Trp Phe Val Asn Leu Ala Val Ala Asp Phe Leu Phe Asn 

75 80 85 

ATC TTC TTG CCC ATC CAC ATC ACC TAT GCC GCT ATG GAC TAC CAC TGG 522 
He Phe Leu Pro He His He Thr Tyr Ala Ala Met Asp Tyr His Trp 
90 95 100 

GTG TTC GGG AAA GCC ATG TGC AAG ATT AGT AGC TTT CTG CTA AGC CAC 570 
Val Phe Gly Lys Ala Met Cys Lys He Ser Ser Phe Leu Leu Ser His 
105 110 115 

AAC ATG TAC ACC AGC GTC TTC CTG CTC ACT GTC ATC AGC TTC GAC CGC 618 
Asn Met Tyr Thr Ser Val Phe Leu Leu Thr Val He Ser Phe Asp Arg 
120 125 130 135 

TGC ATC TCC GTG CTC CTC CCC GTC TGG TCC CAG AAC CAC CGC AGC GTG 666 
Cys He Ser Val Leu Leu Pro Val Trp Ser Gin Asn His Arg Ser Val 

140 145 150 

CGT CTG GCC TAC ATG ACC TGC GTG GTT GTC TGG GTC CTG GCT TTC TTC 714 
Arg Leu Ala Tyr Met Thr Cys Val Val Val Trp Val Leu Ala Phe Phe 
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155 160 165 

CTG AGT TCC CCG TCC CTC GTC TTC CGG GAC ACG GTC AGC ACC AGC CAC 762 
Leu Ser Ser Pro Ser Leu Val Phe Arg Asp Thr Val Ser Thr Ser His 
170 175 180 

GGG AAG ATA ACC TGC TTC AAC AAC TTC AGC CTG GCC GCG CCC GAG CCT 810 
Gly Lys lie Thr Cys Phe Asn Asn Phe Ser Leu Ala Ala Pro Glu Pro 
185 190 195 

TTC TCT CAT TCC ACC CAC CCG CGA ACA GAC CCG GTA GGG TAC AGC AGA 858 
Phe Ser His Ser Thr His Pro Arg Thr Asp Pro Val Gly Tyr Ser Arg 
200 205 210 215 

CAT GTG GCG GTC ACC GTC ACC CGC TTC CTC TGT GGC TTC CTG ATC CCC 906 
His Val Ala Val Thr Val Thr Arg Phe Leu Cys Gly Phe Leu He Pro 

220 225 230 

GTC TTC ATC ATC ACG GCC TGT TAC CTC ACC ATC GTC TTC AAG TTG CAG 954 
Val Phe He He Thr Ala Cys Tyr Leu Thr He Val Phe Lys Leu Gin 

235 240 245 

CGC AAC CGC CTG GCC AAG ACC AAG AAG CCC TTC AAG ATC ATC ATC ACC 1002 
Arg Asn Arg Leu Ala Lys Thr Lys Lys Pro Phe Lys lie He He Thr 
250 255 ' 260 



ATC ATC ATC ACC TTC TTC CTC TGC TGG TGC CCC TAC CAC ACA CTC TAC 1050 
lie He lie Thr Phe Phe Leu Cys Trp Cys Pro Tyr His Thr Leu Tyr 
265 270 275 

CTG CTG GAG CTC CAC CAC ACG GCT GTG CCA GCC TCT GTC TTC AGC CTG 1098 
Leu Leu Glu Leu His His Thr Ala Val Pro Ala Ser Val Phe Ser Leu 
280 285 290 295 

GGA CTG CCC CTG GCC ACA GCC GTT GCC ATC GCC AAC AGC TGT ATG AAC 1146 
Gly Leu Pro Leu Ala Thr Ala Val Ala lie Ala Asn Ser Cys Met Asn 

300 305 310 

CCC ATC CTG TAC GTC TTC ATG GGC CAC GAC TTC AAA AAA TTC AAG GTG 1194 

Pro He Leu Tyr Val Phe Met Gly His Asp Phe Lys Lys Phe Lys Val 

315 320 325 

GCC CTT TTC TCC CGC CTG GTG AAT GCC CTG AGC GAG GAC ACA GGA CCC 1242 
Ala Leu Phe Ser Arg Leu Val Asn Ala Leu Ser Glu Asp Thr Gly Pro 
330 335 340 

TCC TCC TAC CCC AGT CAC AGG AGC TTC ACC AAG ATG TCC TCA TTG ATT 1290 
Ser Ser Tyr Pro Ser His Arg Ser Phe Thr Lys Met Ser Ser Leu He 
345 350 355 

GAG AAG GCT TCA GTG AAT GAG AAA GAG ACC AGC ACC CTC TGAGCCTTCC 1339 
Glu Lys Ala Ser Val Asn Glu Lys Glu Thr Ser Thr Leu 
360 365 370 



CTGGGACTGT 


CCCCAATGGG 


CTCCACAGCC 


CAGGGACTTG 


TCTCCTGAGG 


TAGGAGACAC 


1399 


GCCTGGAGCC 


TTTGGGTATG 


CCCCAGCGCC 


CGCTACGTCT 


TGCGCAAGGC 


GGCCCATTGT 


1459 


TTTGGGTGGA 


GTTCCCAAGT 


GTGGACACTC 


TTCCAGCAAA 


ATGGCAGACA 


AGCAACCTGA 


1519 


GCCCCTGCAG 


CAGGAGTATA 


GGACCAACTT 


CCACCGGCTC 


AGAAAAGGGA 


GGGTCTCTGA 


1579 


AACCAAGGTC 


TGAGCTGTGA 


CCCACATACA 


GGCACACAAT 


TTCACTGTGG 


ATGCCCCAAA 


1639 
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CATGCTGCAC 


AGTCTCCAGT 


GTGGGTGAGG 


ACGTCACTGC 


AGACCTGTTA 


TGGCGACATG 


1699 


ACAGTCAAAC 


CAGCAAATAC 


CCCACCACCA 


CTGTCATCCT 


CCAAGATCTT 


GACTTTGGAT 


1759 


TTCAGAAAAA 


TGGGGAGGAC 


CAGGACCTGA 


GGGGCTTCAT 


AGAACTTGCT 


TGCAAGGGCA 


1819 


GCCCAGGTGT 


GTGTGTGTGT 


ATGTGCTTGT 


GTGTGTGTGT 


GTGTGTATGT 


GCATGTGTGT 


1879 


GTGTGTGTGT 


GTGTGTGTGT 


GTGTGTGTTG 


AAGAGGTGGA 


GACAGAGAGC 


AGAGAACCCC 


1939 


AAAGGATGTG 


GGTACAGGAT 


ATCTCACACC 


ATCATGCCAC 


AGGGAGACAG 


AGCTACAACG 


1999 


ATGGGTGCTT 


GGGGGTTCAG 


ATGGTGGCAG 


CTCTTTGCTT 


CAGTGTGTAT 


ACCCCCCTAG 


2059 


CACTGTACCC 


CCAGGCTGGA 


GGGGGGCTCT 


TCACAGAAGC 


TTCCAACCCA 


TTCCAGCACT 


2119 


GGCTGGAAGG 


GCTTGQGATG 


TCCCTGGAGA 


CTGGTCTGGG 


ACTATCTTCA 


AAGCGTCTTC 


2179 


CAAAGGCTGC 


TGCTGACCAG 


CTCACCTGAC 


TCCTCCTTGA 


ACACACTCCC 


CACGTGGGGC 


2239 


ATCACAGCCC 


CGATGGCAGC 


CCAGGATCAG 


GCCTCCCTCC CTTGACAGGG AAATGACTCA 


2299 


CCCAACCCCA 


TGCACTGTCA 


GTAACAAAGA 


CCCAGGCCCA 


GCTCCAAAAA 




2349 



(2) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 372 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii> MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 

Met Glu Tyr Glu Gly Tyr Asn Asp Ser Ser lie Tyr Gly Glu Glu Tyr 
15 10 15 

Ser Asp Gly Ser Asp Tyr lie Val Asp Leu Glu Glu Ala Gly Pro Leu 

20 25 30 

Glu Ala Lys Val Ala Glu Val Phe Leu Val Val He Tyr Ser Leu Val 
35 40 45 

Cys Phe Leu Gly He Leu Gly Asn Gly Leu Val He Val He Ala Thr 
50 55 60 

Phe Lys Met Lys Lys Thr Val Asn Thr Val Trp Phe Val Asn Leu Ala 
65 70 75 80 

Val Ala Asp Phe Leu Phe Asn He Phe Leu Pro He His He Thr Tyr 

85 90 95 

Ala Ala Met Asp Tyr His Trp Val Phe Gly Lys Ala Met Cys Lys He 

100 105 110 

Ser Ser Phe Leu Leu Ser His Asn Met Tyr Thr Ser Val Phe Leu Leu 
115 120 125 

Thr Val He Ser Phe Asp Arg Cys He Ser Val Leu Leu Pro Val Trp 
130 135 140 
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Ser Gin Asn His Arg Ser Val Arg Leu Ala Tyr Met Thr Cys Val Val 
145 150 155 160 

Val Trp Val Leu Ala Phe Phe Leu Ser Ser Pro Ser Leu Val Phe Arg 

165 170 175 

Asp Thr Val Ser Thr Ser His Gly Lys lie Thr Cys Phe Asn Asn Phe 

180 185 190 

Ser Leu Ala Ala Pro Glu Pro Phe Ser His Ser Thr His Pro Arg Thr 
195 200 205 

Asp Pro Val Gly Tyr Ser Arg His Val Ala Val Thr Val Thr Arg Phe 

210 215 220 

i 

Leu Cys Gly Phe Leu lie Pro Val Phe lie lie Thr, Ala Cys Tyr Leu 
225 230 235 240 

Thr lie Val Phe Lys Leu Gin Arg Asn Arg Leu Ala Lys Thr Lys Lys 

245 250 255 

Pro Phe Lys He He He Thr He lie He Thr Phe Phe Leu Cys Trp 

260 265 270 

Cys Pro Tyr His Thr Leu Tyr Leu Leu Glu Leu His His Thr Ala Val 
275 280 285 

Pro Ala Ser Val Phe Ser Leu Gly Leu Pro Leu Ala Thr Ala Val Ala 
290 295 300 

t i 

He Ala Asn Ser Cys Met Asn Pro He Leu Tyr Val Phe Met Gly His 
305 310 315 320 

Asp Phe Lys Lys Phe Lys Val Ala Leu Phe Ser Arg Leu Val Asn Ala 

325 330 335 

Leu Ser Glu Asp Thr Gly Pro Ser Ser Tyr Pro Ser His Arg Ser Phe 

340 345 350 

Thr Lys Met Ser Ser Leu He Glu Lys Ala Ser Val Asn Glu Lys Glu 
355 360 365 

Thr Ser Thr Leu 
370 

<2) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2577 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 



(vii) IMMEDIATE SOURCE: 
(B) CLONE: HW056 

(ix) FEATURE: 
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(A) NAME /KEY: CDS 
{B) LOCATION: 8.. 523 

(ix) FEATURE: 

(A) NAME /KEY: misc_f eature 

(B) LOCATION: complement (738.. 953) 

(D) OTHER INFORMATION: /labels SAC_24326 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 

GTCGACG GGC TAC CTT AGC CAG CTG ATG CTT CCA GTG ACA GAG CCA CTT 49 
Gly Tyr Leu Ser Gin Leu Met Leu Pro Val Thr Glu Pro Leu 
1 5 10 

TGT CCT TTG CAC AGC CTG ACA CCC TAT CAG GTC CCC TTC AAT GCA GTT 97 
Cys Pro Leu His Ser ( Leu Thr Pro Tyr Gin Val Pro Phe Asn Ala Val 
15 ,20 25 30 

GCT ATC CGG GTC ACG CAC GCC GAC GTT GCA CCT ACC CAC ATA CTG TAC 145 
Ala lie Arg Val Thr His Ala Asp Val Ala Pro Thr His He Leu Tyr 

35 40 1 45 

GCT ATG AAT GCC AGC TGG GTC GGC CTT TGC AAG ATT GTG GAT GAT ATG 193 
Ala Met Asn Ala Ser Trp Val Gly Leu Cys Lys He Val Asp Asp Met 

50 55 ' 60 

AAA GGC TAC ACT CGG' GGC CCC ATT CTG CTG GCC CAG AAC CCC ATA TGT 241 
Lys Gly Tyr Thr Arg Gly Pro He Leu Leu Ala Gin Asn Pro lie Cys 
65 70 75 

GAC TGT TTG GGC TTT GGT ATC TGC AGA GGC ATT GAC ATG GAC AAG CGG 289 
Asp Cys Leu Gly Phe Gly He Cys Arg Gly He Asp Met Asp Lys Arg 
80 85 90 

CTG TAC CAC ATC CTC ACC CCT CTA CCA CCA GAG GAG TTA AGA ACT GTG 337 
Leu Tyr His He Leu Thr Pro Leu Pro Pro Glu Glu Leu Arg Thr Val 
95 100 105 110 

AAC TGT CTG CTG GTT GGC ACC ATT TCC ATT CCA CAT TGT ATT TTC AAG 3 85 

Asn Cys Leu Leu Val Gly Thr He Ser lie Pro His Cys He Phe Lys 

115 120 125 

AAT CAG CCT GGG ACT GAA GGG ACA GTT CCT TAT GTC ACC AGG GAT TAT 433 
Asn Gin Pro Gly Thr Glu Gly Thr Val Pro Tyr Val Thr Arg Asp Tyr 

130 135 140 

AAT TTA AAA CTT CTT GGA GCA TCA GAG AAA ATT GGG GAG AGA GAG TAT 481 
Asn Leu Lys Leu Leu Gly Ala Ser Glu Lys He Gly Glu Arg Glu Tyr 
145 150 155 

AGA AAT ATA CTA CCT AGA CAC AAA TCA CGG CAA AGA AGA AAA 523 
Arg Asn He Leu Pro Arg His Lys Ser Arg Gin Arg Arg Lys 
160 165 170 

TGAAGCCTGG GTGTACAGAG AGACTTTCGC CTCGACACAT TGTCCCCAAG CCTAGAGTCA 583 

GCAGACTGAG CGGGCTGTGT GGCTGCTGAC CTCGTGCCCA TGCCCAGTGT TCATATAACT 643 

GGAAAATGTG CTGTGGTAAG AGTCATTCAA GCCCCTTCGT TTTTTTCCCC TCTAAGACAT 703 

GGAGGGCCTC ATATAGCTCA GACTGGCCTC AGTACTCTGG TCCTCCTGTC TCACTGTCCA 763 
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AGGGCTGCGC 


ATCCGGCTTT 


TGGTCTCATA 


CTTGGCCCAG 


ACCTTGCCGA 


CTGGTTGTCC 


823 


TGGGTGTGTG 


GTGCTCCTGG 


AGAGCTGGGA 


AGCATTGAAG 


AAAGCTGTGG 


TGTCCTCCAA 


883 


GAAGACCAGG 


GCCACAAAAG 


TGTGCCTAGA 


AACTGAGTAG 


CAGGCTGACC 


ACAGGTTCTC 


943 


GGGTGCAGTG 


TACGGACTGG 


CAGGGACCTG 


AGCAGGCTCA 


GTGGCCGTGA 


GTAAGAGGCT 


1003 


TACTCCGTTG 


AACCTTGTGG 


ACCTTAGCAC 


CCCAGTGTGG 


CGGCTCATTG 


GCCTGATCCT 


1063 


TTGATGCTTG 


CCATCGAATC 


TCCCAATGCT 


GCAATCAAGT 


GTTAGAGGAC 


GGTACGTCGT 


1123 


CTTGTTAATT 


TCTGGGTCTT 


TAATCATGAG 


GCTTTATGTT 


GAAGAACTTG 


CTCTGACATT 


1183 


CAGAAAGCCT 


CACGTTAGTT 


AAAGAAATCC 


ATGGTTCCCT 


AATTCTGGGG 


ATGTTCTGAC 


1243 


AGACCTTGAG 


GCTGTAGATA 


CGTCAGAAAG 


CCTCCATGTT AGTTAAGGAA ATCCATGGTT 


1303 


CCCTAATTCT 


GGGGATGTTC 


TGACTGGGAA 


TGTTCTGACA 


GACCTTGAGG 


CTGAGATACT 


1363 


ACCATGGGCC 


CACTAGCCTG 


GGACAGGCAC 


ACATATGCCT 


GGCTTGGGCA 


GACCTTTGCT 


1423 


ATCAACGATT 


TCTTCTAAGG 


CCCCAGAGGC 


CACTGCAAGT 


GGATGTTGCA 


ATCCGGAGTG 


1483 


ACTGGGTCGG 


GTCATTTTTG 


TGGTTGCAAA 


GCAGGACCCT 


GAGGAGCCTC 


TTGGGCTGGC 


1543 


ATGATGGATC 


TGACAAGACT 


CCACCTTGCA 


TGGAATCAGG 


AAGGACCAAG 


GCTATGGAAG 


1603 


GCTCACAACA 


TTGATTGTCT 


CCGTCTTGTC 


TGTAGTCATA G&AGTGTGGC CAGTCTCCTC 


1663 


TGCTTGGCGC 


CATCTTTGTG 


GTGGCCTCAG 


CATCTGAAAA 


TCCTCCGAGG 


AGGCAGAGCT 


1723 


GCCTCTTCTG 


GATTAGTCGG 


ATTCCCTGTA 


GATCACTCGG 


GCTCACAACT ACTCCCTGCC 


1783 


CCACCCCCAA 


CACCATCTCT 


GTCTCACTTG 


AGCTTGTGGT 


GAGCCTTTGC 


AGAGGGGAGG 


1843 


AGCTCTTCCT 


GCCCAGGCAC 


TCTCTGGGCT 


CCTGATCCTC 


AAGCCTCTGC 


TTCCTGGGTG 


1903 


CTCAGATCAC 


AGGTGTGTGC 


CGTCTTAGCC 


TTGCGGACAG 


TTTTAAGACA 


GCCTTGGTCA 


1963 


CCTTCATCTC 


CCTCCCCGGG 


TCTTCCTGTC 


TTCACAATAA 


AGATCCCGAG 


TCCAAAAAAA 


2023 


AAAAAAAAAA 


AAATTCCAGG 


AGGAAATCTG 


GGTGCACAGA 


ACTTATCTGA 


GTCGGCTTTG 


2083 


GTGAAGGAGA 


TCCTCAAGGA 


GCAGGAGAAC 


AGGAAGGGCC 


TCATAGCTGC 


CATCTGTGCG 


2143 


GGTCCTACGG 


CCCTGCTGGC 


TCACGAAGTA 


GGCTTTGGAT 


GCAAGGTTAC 


ATCGCACCCA 


2203 


TTGGCTAAGG 


ACAAAATGAT 


GAACGGCAGT 


CACTACAGCT 


ACTCAGAGAG 


CCGTGTGGAG 


2263 


AAGGACGGCC 


TCATCCTCAC 


CAGCCGTGGG 


CCTGGGACCA 


GCTTCGAGTT 


TGCGCTGGCC 


2323 


ATTGTGGAGG 


CACTCAGTGG 


CAAGGACATG 


GCTAACCAAG 


TGAAGGCCCC 


GCTTGTTCTC 




AAAGACTAGA 


GCCCAAGCCC 


TGGACCCTGG 


ACCCCCAGGC 


TGAGCAGGCA TTGGAAGCCC 


2443 


ACTAGTGTGT 


CCACAGCCCA 


GTGAACCTCA 


GGAACTAACG 


TGTGAAGTAG 


CCCGCTGCTC 


2503 


AGGAATCTCG 


CCCTGGCTCT 


GTACTATTCT 


GAGCCTTGCT 


AGTAGAATAA ACAGTTCCCC 


2563 


AAGCTCCTGA 


AAAA 










2577 
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(2) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 172 amino acids 

(B) TYPE: amino acid 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 

Gly Tyr Leu Ser Gin Leu Met Leu Pro Val Thr Glu Pro Leu Cys Pro 
15 10 15 

Leu His Ser Leu Thr Pro Tyr Gin Val Pro Phe Asn Ala Val Ala lie 

20 25 ' 30 

Arg Val Thr His Ala Asp Val Ala Pro Thr His He Leu Tyr Ala Met 
35 40 45 

Asn Ala Ser Trp Val Gly Leu Cys Lys He Val Asp Asp Met Lys Gly 
50 55 |60 

Tyr Thr Arg Gly Pro He Leu Leu Ala Gin Asn Pro He Cys Asp Cys 
65 70 75 80 

i 

Leu Gly Phe Gly He Cys Arg Gly He Asp Met Asp Lys Arg Leu Tyr 

85 90 f 95 

His He Leu Thr Pro Leu Pro Pro Glu Glu Leu Arg Thr Val Asn Cys 

100 105 110 

Leu Leu Val Gly Thr He Ser He Pro His Cys He Phe Lys Asn Gin 
115 120 125 

Pro Gly Thr Glu Gly Thr Val Pro Tyr Val Thr Arg Asp Tyr Asn Leu 
130 135 140 

Lys Leu Leu Gly Ala Ser Glu Lys lie Gly Glu Arg Glu Tyr Arg Asn 

145 150 155 160 

He Leu Pro Arg His Lys Ser Arg Gin Arg Arg Lys 

165 170 

(2) INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1339 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 



(vii) IMMEDIATE SOURCE: 
(8) CLONE: HW057 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 7., 1023 
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(ix) FEATURE: 

(A) NAME/KEY: misc.feature 

(B) LOCATION: 1032.. 1176 

(D) OTHER INFORMATION: /label* SAC.23926 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 

GTCGAC GGC GCG CTG GAG GAC GCA CGG CTT GGC GGG AAG GCG AGG GGA 48 
Gly Ala Leu Glu Asp Ala Arg Leu Gly Gly Lys Ala Arg Gly 
1 5 10 

CCG TGC GGA GCC TCC GGT GCT GGC GGC GGC CAC CAG TTC GGG AGC CGG 96 
Pro Cys Gly Ala Ser Gly Ala Gly Gly Gly His Gin Phe Gly Ser Arg 
15 20 25 30 

GAG CCG GCC GCA GCA CGC GGA ACA GAG CAG AGG GTG GCG GGG CCC GGC 144 
Glu Pro Ala Ala Ala Arg Gly Thr Glu Gin Arg Val Ala Gly Pro Gly 

35 40 45 

GAA ACC ACG ATG AAG CCG AAC AGC GAG GAA GAG GAG GAG TTG GTG CAG 192 
Glu Thr Thr Met Lys Pro Asn Ser Glu Glu Glu Glu Glu Leu Val Gin 

50 55 60 

GGC GTG GGC CCC TGG GAC GAG TGC TTC GAG GTG GCC GTG CAG TTG GCG 240 
Gly Val Gly Pro Trp Asp Glu Cys Phe Glu Val Ala Val Gin Leu Ala 

65 ! 70 75 

TTG CGT GCG GGA CAA ATC ATC AGA AAG GCC CTC ACT GAG GAA AAA CAC 288 
Leu Arg Ala Gly Gin lie He Arg Lys Ala Leu Thr Glu Glu Lys His 
80 85 90 

GTC TCG ACG AAA ACA TCT GCT GCA GAT CTT GTG ACA GAA ACA GAT CAC 336 
Val Ser Thr Lys Thr Ser Ala Ala Asp Leu Val Thr Glu Thr Asp His 
95 100 105 110 

CGA GTA GAA GAC TTA ATT GTT TCT GAG TTG CGA AAG CGG TTC CCT TCA 384 
Arg Val Glu Asp Leu He Val Ser Glu Leu Arg Lys Arg Phe Pro Ser 

115 120 125 

CAC AGG TTC ATT GCA GAA GAG GCC ACA GCC TCC GGG GCC AAG TGT GTG 432 
His Arg Phe He Ala Glu Glu Ala Thr Ala Ser Gly Ala Lys Cys Val 

130 135 140 

CTC ACC CAC AGC CCG ACC TGG ATC ATC GAC CCC ATC GAC GGC ACC TGC 480 
Leu Thr His Ser Pro Thr Trp He He Asp Pro He Asp Gly Thr Cys 
145 150 155 

AAC TTT GTG CAC AGA TTC CCC ACT GTG GCA GTT AGC ATC GGA TTT GCT 528 
Asn Phe Val His Arg Phe Pro Thr Val Ala Val Ser He Gly Phe Ala 
160 165 170 

GTT CAC CAG GAG CTG GAA TTC GGA GTG ATT CAC CAC TGC ACA GAG GAG 576 
Val His Gin Glu Leu Glu Phe Gly Val He His His Cys Thr Glu Glu 
175 180 185 190 

CGG CTG TAC ACC GGC AGG AGG GGC CAG GGC GCC TTT TGC AAT GGC CAG 624 
Arg Leu Tyr Thr Gly Arg Arg Gly Gin Gly Ala Phe Cys Asn Gly Gin 

195 200 205 

AGG CTC CAG GTC TCC AGG GAG ACA GAT CTC GCA AAG GCC TTG GTT CTG 672 
Arg Leu Gin Val Ser Arg Glu Thr Asp Leu Ala Lys Ala Leu Val Leu 
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210 215 220 

ACA GAA ATC GGG CCC AAA CGT GAC CCC GAT ACT CTG AAA GTA TTC CTG 720 
Thr Glu lie Gly Pro Lys Arg Asp Pro Asp Thr Leu Lys Val Phe Leu 
225 230 235 

AGC AAC ATG GAG CGG CTG CTG CAC GCC AAG GCT CAT GGG GTC CGA GTG 768 
Ser Asn Met Glu Arg Leu Leu His Ala Lys Ala His Gly Val Arg Val 
240 245 250 

ATT GGC AGC TCC ACC TTG GCG CTC TGC TAC TTG GCC TCG GGG GCT GCT 816 
lie Gly Ser Ser Thr Leu Ala Leu Cys Tyr Leu Ala Ser Gly Ala Ala 
255 260 265 270 

GAT GCC TAT TAC CAG TTC GGC CTC CAC TGC TGG GAT CTG GCA GCT GCC 864 
Asp Ala Tyr Tyr Gin Phe Gly Leu His Cys Trp Asp Leu Ala Ala Ala 

275 280 285 

ACA GTC ATC ATC AGA GAA GCA GGT GGC ATT GTG ATT GAC ACC TCA GGT 912 
Thr Val He He Arg Glu Ala Gly Gly He Val He Asp Thr Ser Gly 

290 295 300 

GGA CCC CTT GAC CTC ATG TCG TGC AGA GTG GTG GCT GCT GGC ACC AGA 960 
Gly Pro Leu Asp Leu Met Ser Cys Arg Val Val Ala Ala Gly Thr Arg 

305 ( 310 ' 315 

' i 

GAG ATG GCA GTG CTp ATA GCT CAG GCC CTA CAA ACC ATT AAC TAC GGC 1008 1 

Glu Met Ala Val Leu He Ala Gin Ala Leu Gin Thr He Asn Tyr Gly 
320 325 330 

CGG GAC GAT GAG AAG TGAGCCGTAC AGAGCTCTAA GGCTGACATG AGCAGCTCCC 1063 

Arg Asp Asp Glu Lys 

335 

TGGGAAAGAG CTGTCCGGGG GCTTGAGTTC CGGGATAGTC TACCATAGCT GTCCCCGGAC 1123 

CTCGGTGCTT AGCTGATCCT CTCTAATCTC GGGTAGCCCC TTTCCAGGTC GGTACATGGT 1183 

CTTTCATCAG AGCCAAACCC AAATCTTGTG AGGTGTGTTA GTCACCCATC CTGGTTGTTC 1243 

GGAATGCAAA TCTCAGGTAA TAAAGCTTTA GAACGAGCTC TCAGGCCCTC CCCTGCCCGT 1303 

GGTGATACAG AATGCAATAA ATCAGAACTT TAAAAA 1339 

(2) INFORMATION FOR SEQ ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 339 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 

Gly Ala Leu Glu Asp Ala Arg Leu Gly Gly Lys Ala Arg Gly Pro Cys 
15 10 15 

Gly Ala Ser Gly Ala Gly Gly Gly His Gin Phe Gly Ser Arg Glu Pro 

20 25 30 

Ala Ala Ala Arg Gly Thr Glu Gin Arg Val Ala Gly Pro Gly Glu Thr 
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35 40 45 

Thr Met Lys Pro Asn Ser Glu Glu Glu Glu Glu Leu Val Gin Gly Val 
50 55 60 

Gly Pro Trp Asp Glu Cys Phe Glu Val Ala Val Gin Leu Ala Leu Arg 
65 70 75 80 

Ala Gly Gin He He Arg Lys Ala Leu Thr Glu Glu Lys His Val Ser 

85 90 95 

Thr Lys Thr Ser Ala Ala Asp Leu Val Thr Glu Thr Asp His Arg Val 

100 105 110 

Glu Asp Leu He Val Ser Glu Leu Arg Lys Arg Phe Pro Ser His Arg 
115 120 1 125 

Phe He Ala Glu Glu Ala Thr Ala Ser Gly Ala Lys Cys Val Leu Thr 
130 135 140 

His Ser Pro Thr Trp He He Asp Pro He Asp Gly Thr Cys Asn Phe 
145 150 155 ( 160 

Val His Arg Phe Pro Thr Val Ala Val Ser He Gly Phe Ala Val His 

165 170 175 

i 

Gin Glu Leu Glu Phe Gly Val He His His Cys Thr Glu Glu Arg Leu 

180 185 190 

Tyr Thr Gly Arg Arg Gly Gin Gly Ala Phe Cys Asn Gly Gin Arg Leu 
195 200 205 . 

Gin Val Ser Arg Glu Thr Asp Leu Ala Lys Ala Leu Val Leu Thr Glu 
210 215 220 

lie Gly Pro Lys Arg Asp Pro Asp Thr Leu Lys Val Phe Leu Ser Asn 
225 230 235 240 

Met Glu Arg Leu Leu His Ala Lys Ala His Gly Val Arg Val He Gly 

245 250 255 

Ser Ser Thr Leu Ala Leu Cys Tyr Leu Ala Ser Gly Ala Ala Asp Ala 

260 265 270 

Tyr Tyr Gin Phe Gly Leu His Cys Trp Asp Leu Ala Ala Ala Thr Val 
275 280 285 

He He Arg Glu Ala Gly Gly He Val He Asp Thr Ser Gly Gly Pro 
290 295 300 

Leu Asp Leu Met Ser Cys Arg Val Val Ala Ala Gly Thr Arg Glu Met 
305 310 315 320 

Ala Val Leu He Ala Gin Ala Leu Gin Thr He Asn Tyr Gly Arg Asp 

325 330 335 

Asp Glu Lys 



(2) INFORMATION FOR SEQ ID NO: 55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2285 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 



(vii) IMMEDIATE SOURCE: 
(B) CLONE: HW059 

(ix) FEATURE : 

(A) NAME /KEY: CDS 

(B) LOCATION: 9 ,.1271 , 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 1991.. 2162 

(D) OTHER INFORMATION: /label= SAC_24457 

i 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 

GTCGACAG CCC GCC ACT CCA GGC GCG ATG CTG TTC TGG ACT GTG CTC AGC 50 
Pro Ala Thr Pro Gly Ala Met Leu Phe Trp Thr Val Leu Ser 
15 10 

ATG GCT TTG AGT CTG CGG TTG GCA CTG GCG CAG AGC GGC ATA GAG CGC 98 
Met Ala Leu Ser Leu Arg Leu Ala Leu Ala Gin Ser Gly lie Glu Arg 
15 20 25 30 

GGT CCC ACA GCA TCA GCC CCC CAG GGG GAC CTG TTG TTC CTG CTG GAC 146 
Gly Pro Thr Ala Ser Ala Pro Gin Gly Asp Leu Leu Phe Leu Leu Asp 

35 40 45 

AGC TCA GCC AGT GTG TCA CAC TAT GAG TTC TCA AGA GTT CGA GAG TTT 194 
Ser Ser Ala Ser Val Ser His Tyr Glu Phe Ser Arg Val Arg Glu Phe 

50 55 60 

GTG GGG CAG CTG GTG GCT ACG ATG CCT TTC GGA CCC GGG GCT CTG CGT 242 
Val Gly Gin Leu Val Ala Thr Met Pro Phe Gly Pro Gly Ala Leu Arg 
65 70 75 

GCT AGT CTG GTG CAC GTG GGC AGC CGG CCT CAC ACG GAG TTT ACT TTT 290 
Ala Ser Leu Val His Val Gly Ser Arg Pro His Thr Glu Phe Thr Phe 
80 85 90 

GAT CAG TAC AGT TCA GGC CAG GCC ATA CAG GAT GCT GTA CGT GTT GCG 338 
Asp Gin Tyr Ser Ser Gly Gin Ala lie Gin Asp Ala Val Arg Val Ala 
95 100 105 110 

CCC CAA CGT ATG GGT GAC ACC AAC ACG GGC CTG GCA CTG GCT TAT GCC 386 
Pro Gin Arg Met Gly Asp Thr Asn Thr Gly Leu Ala Leu Ala Tyr Ala 

115 120 125 

AAA GAA CAA TTA TTT GCT GAG GAA GCA GGT GCC CGG CTA GGG GTT CCC 434 
Lys Glu Gin Leu Phe Ala Glu Glu Ala Gly Ala Arg Leu Gly Val Pro 

130 135 140 

AAG GTA CTG GTG TGG GTG ACA GAT GGG GCC TCC AGT GAC TCT GTG GGC 482 
Lys Val Leu Val Trp Val Thr Asp Gly Ala Ser Ser Asp Ser Val Gly 
145 150 155 
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CCC CCT ATG CAG GAG CTC AAG GAC CTG GGT GTC ACC ATC TTC ATT GTC 530 
Pro Pro Met Gin Glu Leu Lys Asp Leu Gly Val Thr lie Phe lie Val 
160 165 170 

AGC ACT GGC CGA GGC AAC CTG TTG GAG CTG TTG GCA GCT GCC TCA GCT 578 
Ser Thr Gly Arg Gly Asn Leu Leu Glu Leu Leu Ala Ala Ala Ser Ala 
175 180 185 190 

CCG GCT GAG AAG CAC CTA CAC TTT GTG GAT GTG GAT GAC CTT CCT ATC 626 
Pro Ala Glu Lys His Leu His Phe Val Asp Val Asp Asp Leu Pro lie 

195 200 205 

ATT GCC CGG GAG CTT CGG GGT GCC ATT ATT GAT GCG ATG CAG CCA CAT 674 
lie Ala Arg Glu Leu Arg Gly< Ala He He Asp Ala Met Gin Pro His 

210 215 220 

CAG CTT CAT GCT TCG' GAG ATT CTG TCC AAT GGC TTC CGC CTG TCC TGG 722 
Gin Leu His Ala Ser Glu He Leu Ser Asn Gly Phe Arg Leu Ser Trp 
225 230 235 

CCG CCC CTG CTG ACA GCG GAC TCT GGT TAC TAC GTG CTG GAG TTG GTG 770 
Pro Pro Leu Leu Thr Ala Asp Ser Gly Tyr Tyr Val Leu Glu Leu Val 
240 245 250 

CCC AGT GGC AAA CTG GCA GCC ACA AGA CGC CAA CAG CTG CCC GGG AAT 818 
Pro Ser Gly Lys Leu Ala Ala Thr Arg Arg Gin Glh Leu Pro Gly Asn 
255 1 260 265 270 

GCT ACC AGC TGG ACC TGG ACC GAC CTC AAC CCA GAC ACA GAT TAC GAA 866 
Ala Thr Ser Trp Thr Trp Thr Asp Leu Asn Pro Asp Thr Asp Tyr Glu 

275 280 285 

GTA TCG CTG TTG CCG GAG TCC AAT GTG CGC CTC CTG AGG CCC CAG CAC 914 
Val Ser Leu Leu Pro Glu Ser Asn Val Arg Leu Leu Arg Pro Gin His 

290 295 300 

TTG CGA GTA CGC ACA CTG CAA GAG GAG GCA GGG CCA GAA CGC ATC GTC 962 
Leu Arg Val Arg Thr Leu Gin Glu Glu Ala Gly Pro Glu Arg He Val 

305 310 315 

ATC TCG CAT ACT AGG CCG CGC AGC CTC CGT GTA AGT TGG GCC CCC GCA 1010 
He Ser His Thr Arg Pro Arg Ser Leu Arg Val Ser Trp Ala Pro Ala 
320 325 330 

CTT GGC CCG GAC TCC ACT CTC GGC TAC CTT GTA CAG CTC GGA CCT CTG 1058 
Leu Gly Pro Asp Ser Thr Leu Gly Tyr Leu Val Gin Leu Gly Pro Leu 
335 340 345 350 

CAG GGC GGA TCC CTA GAA CAC GTG GAG GTG CCA GCT GGC CAG AAC AGC 1106 
Gin Gly Gly Ser Leu Glu His Val Glu Val Pro Ala Gly Gin Asn Ser 

355 360 365 

ACT ACC ATC CAG GGC CTG ACG CCC TGC ACC ACT TAC CTG GTG ACT GTG 1154 
Thr Thr He Gin Gly Leu Thr Pro Cys Thr Thr Tyr Leu Val Thr Val 

370 375 380 

ACT GCC GCC TTC CGC TCG GGC CGC CAG AGG GCG CTG TCG GCT AAG GCC 1202 
Thr Ala Ala Phe Arg Ser Gly Arg Gin Arg Ala Leu Ser Ala Lys Ala 
385 390 395 

TGT ACG GCA TCT GGC GAG CGG ATC CGT GTC CCG CAG GCC ATG CGG CCG 1250 
Cys Thr Ala Ser Gly Glu Arg He Arg Val Pro Gin Ala Met Arg Pro 
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400 405 410 

GAG GCC GGA CTG CGG GAG CCC TGACCTTCCT GCCTCGTCCA CCCGAGGGGC 1301 
Glu Ala Gly Leu Arg Glu Pro 
415 420 
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CCCAGGACCT 


GCACTGGGCC 


CCGCTATCAG 


TGCGGGGGGG 


GGGGTGCAGA 


GTCTTCACAG 


1961 


GAATGGGGGA 


TGAGACCTTG 


GCATGTAGTA 


CATTGGGGAT 


AGGAGAGCCC 


TGCCGTGACA 


2021 


GACTTACAGG 


GAGTCTCCTG 


CTTTAGTGTA 


GGGAGCAAGG 


TGACATGCAG 


GTGGGCTACC 


2081 


TCCTGTCATC 


ACTACTGCCC 


TGGGGCATCT 


GACAGATACC 


TAAGGGTAGT 


CAGGAACAGG 


2141 


CTTCCTCTCC 


AGTCCCTATG 


TACGCAGAGC 


CCCTCCTCCC 


CAGAACCTCT 


TGCCCTAACC 


2201 


TAAGCTTACT 


CCATCTCTCT 


TCCCCACTAA 


TGACCCAGAC 


TCTAACAATA 


ATACAGTAAG 


2261 


CCAGATGTAA 


ACTGTGAAGT 


CGAC 








2285 



(2) INFORMATION FOR SEQ ID NO: 56: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 421 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 

Pro Ala Thr Pro Gly Ala Met Leu Phe Trp Thr Val Leu Ser Met Ala 
15 10 15 

Leu Ser Leu Arg Leu Ala Leu Ala Gin Ser Gly He Glu Arg Gly Pro 

20 25 30 

Thr Ala Ser Ala Pro Gin Gly Asp Leu Leu Phe Leu Leu Asp Ser Ser 
35 40 45 

Ala Ser Val Ser His Tyr Glu Phe Ser Arg Val Arg Glu Phe Val Gly 
50 55 60 

- 135 - 
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Gin Leu Val Ala Thr Met Pro Phe Gly Pro Gly Ala Leu Arg Ala Ser 
65 70 75 80 

Leu Val Hie Val Gly Ser Arg Pro His Thr Glu Phe Thr Phe Asp Gin 

85 90 95 

Tyr Ser Ser Gly Gin Ala He Gin Asp Ala Val Arg Val Ala Pro Gin 

100 105 110 

Arg Met Gly Asp Thr Asn Thr Gly Leu Ala Leu Ala Tyr Ala Lys Glu 
115 120 125 

Gin Leu Phe Ala Glu Glu Ala Gly Ala Arg Leui Gly Val Pro Lys Val 
130 135 i 140 

Leu Val Trp Val Thr Asp Gly Ala Ser Ser Asp Ser Val Gly Pro Pro 
145 150 155 160 

Met Gin Glu Leu Lys Asp Leu Gly Val Thr He Phe He Val Ser Thr 

165 170 175 

Gly Arg Gly Asn Leu Leu Glu Leu Leu Ala Ala Ala Ser Ala Pro Ala 

180 185 190 

Glu Lys His Leu His Phe Val Asp Val Asp Asp Leu Pro He lie Ala 

195 200 205 

i 

Arg Glu Leu Arg Gly Ala He He Asp Ala Met Gin Pro His Gin Leu 
210 215 220 

His Ala Ser Glu He Leu Ser Asn Gly Phe Arg Leu Ser Trp Pro Pro 
225 230 235 240 

Leu Leu Thr Ala Asp Ser Gly Tyr Tyr Val Leu Glu Leu Val Pro Ser 

245 250 255 

Gly Lys Leu Ala Ala Thr Arg Arg Gin Gin Leu Pro Gly Asn Ala Thr 

260 265 270 

Ser Trp Thr Trp Thr Asp Leu Asn Pro Asp Thr Asp Tyr Glu Val Ser 
275 280 285 

Leu Leu Pro Glu Ser Asn Val Arg Leu Leu Arg Pro Gin His Leu Arg 
290 295 300 

Val Arg Thr Leu Gin Glu Glu Ala Gly Pro Glu Arg He Val He Ser 
305 310 315 320 

His Thr Arg Pro Arg Ser Leu Arg Val Ser Trp Ala Pro Ala Leu Gly 

325 330 335 

Pro Asp Ser Thr Leu Gly Tyr Leu Val Gin Leu Gly Pro Leu Gin Gly 

340 345 350 

Gly Ser Leu Glu His Val Glu Val Pro Ala Gly Gin Asn Ser Thr Thr 
355 360 365 

He Gin Gly Leu Thr Pro Cys Thr Thr Tyr Leu Val Thr Val Thr Ala 
370 375 380 



Ala Phe Arg Ser Gly Arg Gin Arg Ala Leu Ser Ala Lys Ala Cys Thr 
385 390 395 400 
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Ala Ser Gly Glu Arg He Arg Val Pro Gin Ala Met Arg Pro Glu Ala 

405 410 415 



Gly Leu Arg Glu Pro 

420 

(2) INFORMATION FOR SEQ ID NO: 57: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2521 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL : NO 



(vii) IMMEDIATE SOURCE: 
(B) CLONE: HW061 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 7.. 1422 

(ix) FEATURE: ' 

(A) NAME /KEY : misc_feature 

(B) LOCATION: complement (2089.. 2287) 

(D) OTHER INFORMATION: /label= SAC_24029 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 

GTCGAC GCG ACG GTG GCG GAG CAG TGG GTG CTG GTG GAG ATG GTG CAG 48 
Ala Thr Val Ala Glu Gin Trp Val Leu Val Glu Met Val Gin 
1 5 10 

GCG CTG TAC GAG GCT CCA GCA TAC CAT CTA ATT CTG GAA GGA ATC CTC 96 
Ala Leu Tyr Glu Ala Pro Ala Tyr His Leu He Leu Glu Gly He Leu 
15 20 25 30 

ATA CTT TGG ATA ATC AGA CTC GTT TTC TCT AAA ACT TAC AAG TTG CAG 144 
He Leu Trp He He Arg Leu Val Phe Ser Lys Thr Tyr Lys Leu Gin 

35 40 45 

GAG CGT TCT GAC CTT AC A GCC AAG GAA AAG GAA GAA CTG ATT GAA GAA 192 
Glu Arg Ser Asp Leu Thr Ala Lys Glu Lys Glu Glu Leu He Glu Glu 

50 55 60 

TGG CAG CCA GAG CCC CTC GTC CCC CCG GTC TCC AGG AAC CAT CCT GCT 240 
Trp Gin Pro Glu Pro Leu Val Pro Pro Val Ser Arg Asn His Pro Ala 
65 70 75 

CTC AAC TAC AAC ATC GTC TCC GGC CCT CCA ACC CAT AAC ATC GTG GTG 288 
Leu Asn Tyr Asn He Val Ser Gly Pro Pro Thr His Asn He Val Val 
80 85 90 

AAC GGA AAA GAG TGT GTC AAC TTT GCC TCC TTT AAT TTC CTC GGG TTG 336 
Asn Gly Lys Glu Cys Val Asn Phe Ala Ser Phe Asn Phe Leu Gly Leu 
95 100 105 110 
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CTG GCC AAC CCT CGG GTT AAG GCT GCG GCT TTT GCA TCA TTA AAG AAG 384 
Leu Ala Asn Pro Arg Val Lys Ala Ala Ala Phe Ala Ser Leu Lys Lys 

115 120 125 

TAT GGC GTG GGT ACC TGT GGT CCC AGA GGG TTT TAT GGC ACA TTT GAT 432 
Tyr Gly Val Gly Thr Cys Gly Pro Arg Gly Phe Tyr Gly Thr Phe Asp 

130 135 140 

GTC CAT CTG GAT TTG GAA GAG CGC CTG GCA AAG TTT ATG AAG ACC GAA 480 
Val His Leu Asp Leu Glu Glu Arg Leu Ala Lys Phe Met Lys Thr Glu 
145 150 155 

GAA GCT ATC ATT TAC TCC TAT GGC TTC TCC ACC ATA GCC AGT GCG ATT 528 
Glu Ala lie lie Tyr Ser Tyr Gly Phe Ser Thr He Ala Ser Ala He 
160 165 170 

CCT GCG TAC TCT AAG AGA GGG GAC ATT GTT TTT GTG GAC AGT GCA GCC 576 
Pro Ala Tyr Ser Lys Arg Gly Asp He Val Phe Val Asp Ser Ala Ala 
175 180 185 190 

TGC TTT GCT ATC CAG AAA GGA CTA CAG GCA TCA CGC AGT GAC ATT AAG 624 
Cys Phe Ala He Gin Lys Gly Leu Gin Ala Ser Arg Ser Asp He Lys 

195 200 205 

TTA TTC AAG CAC AAT GAC GTA GCT GAC CTG GAG CGA CTG CTA AAA GAA 672 
Leu Phe Lys His Asn Asp Val Ala Asp Leu Glu Arg Leu Leu Lys Glu 

210 215 220 

CAA GAG ATT GAA GAT CAA AAG AAT CCT CGA AAG GCC CGT GTG ACT CGG 720 
Gin Glu He Glu Asp Gin Lys Asn Pro Arg Lys Ala Arg Val Thr Arg 
225 230 235 

CGA TTC ATC GTC GCG GAA GGA TTG TAT ATG AAC ACT GGA ACC ATC TGC 768 
Arg Phe He Val Ala Glu Gly Leu Tyr Met Asn Thr Gly Thr He Cys 
240 245 250 

CCT CTT CCA GAA CTG GTG AGG TTA AAG TAT AAA TAC AAA GCA AGG ATC 816 
Pro Leu Pro Glu Leu Val Arg Leu Lys Tyr Lys Tyr Lys Ala Arg He 
255 260 265 270 

TTC CTG GAG GAG AGC CTG TCG TTC GGA GTC CTT GGG GAG CAT GGG CGA 864 
Phe Leu Glu Glu Ser Leu Ser Phe Gly Val Leu Gly Glu His Gly Arg 

275 280 285 

GGA GTC ACC GAG CAC TAT GGG ATC AGT ATT GAC GAC ATC GAC CTT ATC 912 
Gly Val Thr Glu His Tyr Gly He Ser He Asp Asp He Asp Leu He 

290 295 300 

AGT GCT AAC ATG GAG AAT GCG CTC GCT TCT GTT GGG GGC TTC TGC TGT 960 
Ser Ala Asn Met Glu Asn Ala Leu Ala Ser Val Gly Gly Phe Cys Cys 
305 310 315 

GGC CGC TCT TTC GTG GTT GAC CAT CAG CGG CTC TCT GGC CAA GGA TAC 1008 
Gly Arg Ser Phe Val Val Asp His Gin Arg Leu Ser Gly Gin Gly Tyr 
320 325 330 

TGC TTT TCA GCT TCA CTG CCC CCC CTG CTA GCC GCT GCC GCC ATT GAG 1056 
Cys Phe Ser Ala Ser Leu Pro Pro Leu Leu Ala Ala Ala Ala He Glu 
335 340 345 350 

GCC CTC AAC ATC ATG GAG GAG AAC CCA GGG ATT TTT GCA GTT TTA AAG 1104 
Ala Leu Asn He Met Glu Glu Asn Pro Gly He Phe Ala Val Leu Lys 

355 360 365 
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AAA AAG TGC CAG ACC ATC CAC AAG TCC CTA CAA GGG GTT TCC GGT TTA 1152 
Lys Lys Cys Gin Thr lie His Lys Ser Leu Gin Gly Val Ser Gly Leu 

370 375 380 

AAA GTG GTG GGG GAG TCC CTT TGC CCA GCG CTT CAC CTC CAG CTG GAA 1200 
Lys Val Val Gly Glu Ser Leu Cys Pro Ala Leu His Leu Gin Leu Glu 
385 390 395 

GAG AGC ACG GGC TCT CGG GAG AGA GAT ATG AAG CTG CTT CAG GAG ATT 1248 
Glu Ser Thr Gly Ser Arg Glu Arg Asp Met Lys Leu Leu Gin Glu lie 
400 405 410 

GTA GAG CAA TGC ATG AAT AAG GGC ATC GCA TTG ACT CAG GCA CGC TAC 1296 
Val Glu Gin Cys Met Asn Lys Gly He Ala Leu Thr Gin Ala Arg Tyr 
415 420 425 430 

TTG GAC AAG GAA GAG AAG TGC CTT CCT CCT CCA AGC ATC AGG GTT GTG 1344 
Leu Asp Lys Glu Glu Lys Cys Leu Pro Pro Pro Ser He Arg Val Val 

435 440 445 

GTC ACC GTG GAG CAG ACA GAC GAA GAG CTG CAG AGG GCT GCA GCC ACC 1392 
Val Thr Val Glu Gin Thr Asp Glu Glu Leu Gin Arg Ala Ala Ala Thr 

450 455 460 

ATC AGG GAG GCG GCC CAG GCT GTG CTT TTG TAGGCTCCTG CCCAGTGCTT 1442 
He Arg Glu Ala Ala Gin Ala Val Leu Leu 
465 470 

GCGACCATGT CACCTGCAGA CAGAACTACT CAGACCTCCA GCTGCCCAGG GCGTGGAGTG 1502 

CTGCCCAGGG CGTGGCTGCC CTTGAGTCCA CAGCCAGAAT GGATGGCTTT ACCCAGCAGG 1562 

CTTCCAGAGG ATGGCAGACG ACGGTGTAAC ACTGGTGTTT ATGCGGTGCC ATGGCTCTGT 1622 

CTCTTATACT CGCCTTTGTT TAAAACTGGA GCTTCCGTTT TCCCTCTCCC GCCTGTGGGA 1682 

ACAATTCGAT AAGCACTCCT CTGTATTCCA TACAAAAGGA CTTGCAGCTG AGAGGAAGTC 1742 

TCCACGCACC ACAGTTACTG TGGGATGATA GACTGTGGCC TCCCAGGTCC TTCGGGTAGA 1802 

GCTCAGTGGT GGCTGCTGTG CAGACAACTC CTGTGTGTCA AAACTGTCAT AGAACATGTT 1862 

GATCCTAGAA CTCTGTGTGC TCCAGTTACT GATGCTGTCG TGTCTAATCT GAGCTTCACT 1922 

AACAATAGTC ATACCGCTCC CTAAGCAATA ATCTGAATCA AACATTATTT TATTTTTGTG 1982 

TAATTGACTT TATATTTTTT ATATATTCTC TAGTCAGTTT TGGGAAGAGG GTTATTATTC 2042 

TGGTCCCCAA ATATATAAAT CATATCCTTG GTATTTTTTA ATATGTACGG TGTCCTTTTT 2102 

CATGGTGGTT TTGTCCAGGA ACATAGAACT GTGGGCCTTT TCTAAGCTGA GAAAAGAGAG 2162 

AACTTTCTTC TTTCGTTCTA TAAATTCCAG ATGCCTCAGA AAATAGGAAT GCTCTGAAAA 2222 

CTGCTGCTGT CCTTAGCGAG CGTGCTGAGG GGTGTAACGG CTTGTGACAT GTGCCGCGAT 2282 

CGTGTACATG GAAGGCCATT ACTCCCTCTC GCTGCACAGG CAGAGGAATG GGGCCTCTAA 2342 

CCGTCGTTTT CGGATGTTCA TTCATGTTAA CAGTGGACGG CATCTTCCTA AGTGATCTGT 2402 

GACATGTTTA ATGTGTCTAT TTCATACGTG CCTTGTGAAT GCTGCTGCTG TGAGGGTACG 2462 
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GGGATGAGTC TGTTTTGGTT TGGTTTTTAA TGAAATAAAC CTCCAAGAGC CTACAAAAA 2521 

(2) INFORMATION FOR SEQ ID NO: 58; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 472 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 

Ala Thr Val Ala Glu Gin Trp Val Leu Val Glu Met Val Gin Ala Leu 
15 10 15 

Tyr Glu Ala Pro Ala 1 Tyr His Leu lie Leu Glu Gly lie Leu lie Leu 

20 • ' 25 30 

Trp He He Arg Leu Val Phe Ser Lys Thr Tyr Lys Leu Gin Glu Arg 
35 40 45 

Ser Asp Leu Thr Ala Lys Glu Lys Glu Glu Leu He Glu Glu Trp Gin 
50 55 - 60 

Pro Glu Pro Leu Val Pro Pro Val Ser Arg Asn His 1 Pro Ala Leu Asn 
65 i 70 75 80 

Tyr Asn He Val Ser Gly Pro Pro Thr His Asn He Val Val Asn Gly 

85 90 95 

Lys Glu Cys Val Asn Phe Ala Ser Phe Asn Phe Leu Gly Leu Leu Ala 

100 105 110 

Asn Pro Arg Val Lys Ala Ala Ala Phe Ala Ser Leu Lys Lys Tyr Gly 
115 120 125 

Val Gly Thr Cys Gly Pro Arg Gly Phe Tyr Gly Thr Phe Asp Val His 

130 135 140 

Leu Asp Leu Glu Glu Arg Leu Ala Lys Phe Met Lys Thr Glu Glu Ala 
145 150 155 160 

He He Tyr Ser Tyr Gly Phe Ser Thr He Ala Ser Ala He Pro Ala 

165 170 175 

Tyr Ser Lys Arg Gly Asp He Val Phe Val Asp Ser Ala Ala Cys Phe 

180 185 190 

Ala He Gin Lys Gly Leu Gin Ala Ser Arg Ser Asp He Lys Leu Phe 
195 200 205 

Lys His Asn Asp Val Ala Asp Leu Glu Arg Leu Leu Lys Glu Gin Glu 
210 215 220 

He Glu Asp Gin Lys Asn Pro Arg Lys Ala Arg Val Thr Arg Arg Phe 
225 230 235 240 

He Val Ala Glu Gly Leu Tyr Met Asn Thr Gly Thr He Cys Pro Leu 

245 250 255 

Pro Glu Leu Val Arg Leu Lys Tyr Lys Tyr Lys Ala Arg He Phe Leu 
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260 265 270 

Glu Glu Ser Leu Ser Phe Gly Val Leu Gly Glu His Gly Arg Gly Val 
275 280 285 

* 

Thr Glu His Tyr Gly lie Ser lie Asp Asp lie Asp Leu lie Ser Ala 
290 295 300 

Asn Met Glu Asn Ala Leu Ala Ser Val Gly Gly Phe Cys Cys Gly Arg 
305 310 315 320 

Ser Phe Val Val Asp His Gin Arg Leu Ser Gly Gin Gly Tyr Cys Phe 

325 330 335 

Ser Ala Ser Leu Pro Pro Leu Leu Ala Ala Ala Ala lie Glu Ala Leu 

340 345 350 

Asn He Met Glu Glu Asn Pro Gly He Phe Ala Val Leu Lys Lys Lys 
355 360 365 

Cys Gin Thr He His Lys Ser Leu Gin Gly Val Ser Gly Leu Lys Val 
370 375 1 380 

Val Gly Glu Ser Leu Cys Pro Ala Leu His Leu Gin Leu Glu Glu Ser 
385 390 395 400 

Thr Gly Ser Arg Glu Arg Asp Met Lys Leu Leu Gin Glu He Val Glu 

405 410 415 

Gin Cys Met Asn Lys Gly He Ala Leu Thr Gin Ala Arg Tyr Leu Asp 

420 425 430 

Lys Glu Glu Lys Cys Leu Pro Pro Pro Ser He Arg Val Val Val Thr 
435 440 445 

Val Glu Gin Thr Asp Glu Glu Leu Gin Arg Ala Ala Ala Thr He Arg 
450 455 460 

Glu Ala Ala Gin Ala Val Leu Leu 

465 470 

(2) INFORMATION FOR SEQ ID NO: 59: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 598 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 



(vii) IMMEDIATE SOURCE: 
(B) CLONE: HW062 

(ix) FEATURE : 

(A) NAME/KEY: misc„f eature 

(B) LOCATION: complement (87.. 223) 

(D) OTHER INFORMATION: /label= SAC_24443 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 

TTTTTGTGAG CAAATCCAAA TTTATTTTAA TGCCATGTCA TTTCAAAAAC CCCAGCCTTG 60 

GTTCCCTGGA ACAGCATGCC AGAGGTACAA AGTGTAACCT TCCTCTATAA ACCCCCAGCA 120 

AGTCATCCAA GTCCTCAGCT TCAGAAAGTC AGTTCTCTTC AGTCTACTCC TCTGGTTCCT 180 

GGTTTTTCTT TCAAGGAAGG CAACTTAAAA ATTAGGTAGC TGAGGTTCCA ATGTTGGTGG 240 

CTAACCTTGC CTCCTCTTTT GTTCTGTAGT AGGGCCACAA ACCTTGACCA GGCAGAGTAG 300 

TAGAAAAGTA GAAAGAGGGG CTTGACGACG GTGGATTTTG ACTCCTGATT TTATTATTCA 360 

t 

ATTTCTTTTT CTACTAAAAG TAGTCTTCGG TGGTTGGGAA GCCTGGCCTC CCAACACCAG 420 

AGTCAGTCGG AGCTGGTTTT TTTGTTGAAA GGAGTGGGCG GGTGGGTGGG GGACCGGGAT 480 

GAGGGCAGAA CCCCGCTCTG CTGGTAGTCT TGGGTGGAGA AGACGAACTG CACTTGACAG 540 

AGCCTGGGGG TGCGGTGGGA GGGGGTGAGG CAGGAGTGAC AGCTGGGGAG GGGACCCA 598 
(2) INFORMATION FOR SEQ ID NO: 60: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH; 3305 base pairs 

(B) TYPE: nucleic acid 1 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 

<vii) IMMEDIATE SOURCE: 
(B) CLONE: HW066 

(ix) FEATURE: 

(A) NAME /KEY: misc^feature 

(B) LOCATION: 759.. 906 

(D) OTHER INFORMATION: /label* SAC_24521 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60: 

ATTCGCGGCC GGCTCGACTT TTTTTTTTGT GATATAAAAA ATTTATTTTA GAAATCACAT 60 

CTTTAAAAAA CAACATAGAA GGTTATCCTG ACCAGATATA AATTCTTCAT TCAATGCCAG 120 

GGTGTTATGT GGTACAATTA GCGGGATGTT CTGTGAGGAG AATGAGTCTG TGTGATACTC 180 

TCGACATGTA CACCAGCTCA AACGTTAATG CGGCTTTTAA AAGGAAGGAG AGAGGAAGGC 240 

AGCGAGAGAG GAGGAAACTG TTTTGGAAAC AGAAGAGCAT CTCCGTCCTG CACCTGTCAG 300 

CCCTGAAACT CCCGGCCCTT GAGACCAGCC CAGCAGAGGC TGCACGAANN NGCATCATTT 360 

TAATTCACAA GTGACTGCAG CACCCTGGCA TCTTGTGCTG GGAGTGTGGG TGGGAACAGA 420 
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GGTGACAAGT 


CTGCTCTGTA 


GTGTCATAGC 


CAGAGAGCAA 


CAGGGACTCT 


AGAGGAAGAA 


480 


GAGCAAAGCT 


TGCCTAGACA 


GAACGGGTTG 


TCCCTGTCCC 


TGTCCATCAG 


CTTTCTTATA 


540 


ACCTATGGNT 


AGCCAGGATC 


AAATACCTGA 


AAGTCAGAAT 


GTAAAATAAA 


AGTGTCAGTT 


600 


AACACTTATT 


GCGGACAGAA 


ATACAGCTCT 


TTACGCCTAC 


AACCATAAAC 


CCAGTGGTGA 


660 


GGCTGGGACA 


GACCTTTTAT 


GCCGAAGTTC 


TCACAACTCG 


GGACTGACTT 


ACTGGGGCCT 


720 


GGTCACTGCT 


TAATTGGTGA 


CCTCACACCA 


TTTCAGGTAC 


TTCTGCCCAG 


CTCCTGATTC 


780 


CTCTTCAGGA GAAACAGGAA 


GCAGACTTTG 


TTAGTAAGTG 


GGTCATGCAG 


GGCTGGGCCG 


840 


CCTACATGGG 


ACTTAACACA 


GAGAGAATAC 


TGCCAGCTTC 


CCAAACACTT 


CAAGAGAGAA 


900 


AAATGTCCAG 


ACGTGGGTGC 


TGCAGCAACC 


TTCCAAGAGT 


GAGGCCCCTG 


CACCCTCCTG 


960 


CACCCTCGGC 


TGGGGACAGT 


GGGCTCTGGT 


GGCTTGAACT 


GAGTAGATCG 


GATCATTCCC 


1020 


TCCTCTTTCT 


GAGACAAGGC 


TTCACTGTGT AGCCCNNGCT 


GCCCTGGAGC 


TCACTATTTA 


1080 


GACCAGGCTA 


GGATCTCACA 


GAGAACCACC 


TGCCTCTGGC 


CTCTCAAGAG 


CTGGGATTAA 


1140 


GGTGTGAAAA 


GGGGGCTAAT 


GTAGCCCAGG 


CTGGCTCCTG ctccacaggg 


CTTGCAGGCC 


1200 


AGTTCTGGAT 


CCTACCa!gCA 


AGTACAAGCT 


GGTACCCAGA 


GGACGCTGCA 


GGATGGAAGG 


1260 


ACGGAAGGCC 


AGGAGCCTTC 


GTTTGATTTG ACGGGAAGGA GGAGGAACAG 


TGAGGTTGTG 


1320 


CATCTGAGCT 


TTATTACCAG 


GACGATGATG 


TCACCTCAGG 


AAGGGACTGG 


GATCCCTGCA 


1380 


AGTGGTTCCA 


CATGGGACCC 

* 


AGGTCCTCAG 


AGATGTGGCC 


CTCCTTGTGT 


GCCAAAATGC 


1440 


ATCTGGAAAG 


AAAATACAGA 


AGGGGAGGGA 


CAGCATAGGA 


GGAGACAAGT 


GTGACCCGAG 


1500 


GCCCAGGAAG 


TACTTCCATA 


TTCTGTCCTT 


CTCCAGGCAC 


GAGAGTTGAA 


GGCCGTGCCA 


1560 


AGCCCTGCAC 


TCGCTAAGCT 


GGTGAAGACC 


ACACCAAAAT 


CACTGTGACA 


GGACAGCTGC 


1620 


ACAGATCTGT 


TTCCGACTTT 


TCCCACCACG 


ATTCTACACC 


AAGGTAAAAA 


AGAGCTTCCA 


1680 


CCACAAATCC 


AAGGCAGGAG 


CTGGGCGGCC 


ACACACCCGC 


CACAGATGGT 


GCAAGTTGCT 


1740 


CACTTCCGCA 


GCCGGTGGCA 


GGACAGGGAG 


GAGGGCAGAG 


TCCTGAGACA 


AGAACCCCAG 


1800 


AGCGGGGCCA 


GTTCAGTCAG 


TGGTGGACCT 


CCGGTACCAG 


AACCGAGCTC 


GGAAGTCGTC 


1860 


GCTGCACGTC 


ATGAAGCCAG 


GGTTGGTCGG 


GGAGTGCACT 


ATGCAGCGCA 


CAATGTTGTT 


1920 


GTGGCCCAGG 


GACAGCAGGT 


TCCTGCGCTC 


AGCTGTCCTA 


GAGTCCCAGC 






GATGGTCCTT 


TCGTCCGGCA 


GCAAAATATA 


GTCCTCAGTG 


TGGTTGAACA 


CAGCCTGTGT 


2040 


CCGGGTGCAC 


TTGCCGTCCA 


CTCAGCCCTG 


CGCCTGTGTA 


CCTCACCAGT 


GTCCGTCCCG 


2100 


TGGATATTTC 


CCCAGAGTTT 


AGCTACAGAG 


TCTTTYCCNN 


GAGTTTAGGC 


CTACAGAGTC 


2160 


TTTTCCACTT 


GAGAGGATGT 


ACTTGGAATT 


CTTGGAGAAA 


ATGGCAGAAC 


AGACTTCTGC 


2220 


CCCGTCGTGA 


GCTTTCTCAA 


AGGTTGTGAT 


GCAGCGATTG 


GAGACGCCGT 


CCCACAGCTT 


2280 
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GATGCAGCCG 


TCCTTGCTGC 


CAGTGACATA 


CATGTTGGCA 


CTAGGGTTAT 


AGTTGACAGA 


2340 


GCAGATGGCG 


TCGGTGTGCT 


GGTCTTGGGG 


ATTGCAGGAG 


ACGAAACACT 


GGAAGGTGTT 


2400 


GATGTCATAG 


AGGCGAAGCG 


TGGGATGCTG 


GGTCCCGACC 


AGGATGAAGT 


CTCCAGAAGG 


2460 


GTGGAACGAG 


ATGGAGCGCA 


GCATCTCAGC 


TTCCTGAATG 


TATTTAAAGG 


CTCTTTTTGC 


2520 


AGAGGGTTTG 


GAATAATCAA 


ACAATTTTAG 


AGTATAATCC 


CTTGAGCCGG 


AGGCCAGGAT 


2560 


CTGTTCTGTT 


GGGTGAAAAG 


CGAGACACGT 


GACTTCATCC 


ACATGGTCGT 


AAAGAGTTCG 


2640 


GATCACTGGG 


TGGTTCTCCA 


TGTTCTGTTG 


TGCAGTCTCA 


TTCATCATGA 


CCCTCGATGG 


2700 


GCATGGCGCT 


CTTGGCCAGC 


ATCCTCTCTG 


TGTCCAGTAT 


CTTGATGGAA 

i 


GCATCCGCAG 


2760 


ACCCTGTGGC 


TATCAACTGC 


CCATCTCTGC 


TGTAGGTGGC 


CACTCGGCAT 

! 


GGGCCTTTGT 


2820 


GGGAGGTGAC 


ATAACAGGTT 


TCATACTCCG 


AAGCCTCGGG 


TGACATTGTT 


TGGACGTCTG 


2880 


CATCAAACTC 


CAGGTCAATN 


CTGTGCCAGG 


GGCAACCGTG 


TCAGAGCGAC 


CAATGGCATA 


2940 


CTGAACTGCA 


GTGTCATCGT 


TCTCCATCCC 


TAGCTTGATG 


AGGTGCAGGA 


GCTGCTCCGA 


3000 


GGGCGCACAC 


ACCGACTGTG 


GCTTGATCTC 


GTTGATGAGG 


CCATTGCGAT 


GCTAATGTAT 


3060 


CCGTCATAGA 


GCAGCTGGCT 


GATGATCAGC 


TTGTAGAGCT 


GCTGGCGGTC 


CTTCAGGCCC 


3120 


ACTTTGGTTC 


TGTACATTCT 


GCACAAGAGG 


ATGACAGTTC 


CCCTGACTTC 


TCCGAGCTGA 


3180 


TCACCTCCGC 


GACAGCCTTG 


AATCCAAAGG 


ACTGTGAGTG ACCGGCAAAC GCCCGANCCT 


3240 


TGGAGAGCAC 


NGCGGGCGAT 


AGAACGGCCG 


TAGATCGGAC 


AGGCCGTTGG 


TGCACCGTCG 


3300 


ACCGG 












3305 


(2) INFORMATION FOR SEQ ID NO: 61 


• 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2099 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 



(vii) IMMEDIATE SOURCE: 
(B) CLONE: HW069 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1360.. 1893 

(ix) FEATURE: 

(A) NAME /KEY: misc„£eature 

(B) LOCATION: 1744.. 1885 

(D) OTHER INFORMATION: /label= SAC__24477 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:61: 
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GTCGACGCAT CCAGGCAGGC ATGGAGCAGG TAGGGCTCAG AGTTCTACAT CTTCATCTGG 60 

AGGCTACTAT GAGGAGACCG ACTTCCAGGC AGCTAGGAAG CCCATGCTCA CAGTGACAGA 120 

CCTTCTCCAA CAAGGTTATA CCTCCTAATC GTGCCGCTCC CTGGGCCAAG CATATGCAAA 180 

CCTTCACACA GGTCTTCCTC CTACACAGTG GGGCTTCTGG GCCCCACTCA CGGCCAGTAC 240 

ATGGCAAAAG ACATTAAAGA TGCACTTGGA CCAGCTCAGG GCCCTGTTCT GCAGCCAGGT 300 

ATCATGGATG GAGGAAGTCC TGACTGAAAC AAAAATGTCT GTCCGCATTG TCATCTTTGA 360 

GAGCCTGAAC AAACCCCGAA CACCTCTGAC CTGGCACAGG GACCCTGAGT CTGCCTGCCT 420 

GGCACTCTGA ACTTGCTGGC TACTGGGCTG AAGTCTCAGG ATCGAGGTAC AAGGACAAAG 480 

CCTGTTGGTA TTATGTAGAC AGGAGCCCCA GGGGGCTCAG AGCAGGGGTT TACCGTTACT 540 

CTTGGTCCAG CCGGCACAGT GGTCTTCCCT CAGTTTTTGT TTTTGTTTTT TTGTTTTTTT 600 

GTTTTGTTTT TTGTTTTTTG TTTGTTTGTT TGTTTTGTTT TpTTTTTTGG TGCTGTTGAC 660 

CCTGCAGTCT GGACTGCCCC CTTGGGCAAT ACATTTTGAT GGAAACCAAC TTCTGTGTTC 720 

CCAGATCGGC AGCCTCTCCA TGCCGAGCAC CCCTGTTCTA GCTAAGCCGG ACATGCTACA 780 

CCCCTGCCGT GGTTGGTTCA TCTCCCCACT CCCCGCCGTG TCTTAATAAC ACGGTGGATG 840 

GATGGACAGA CGGACAGATG AAGCTTTGAC AGCTAGCCAC TGCTCTGCTC CTTTTCTCTT 900 

GTCTTCTGAG TAAATTCTCA AACTCAAGCC TCCCAGGGGG AGCCCAGAAC CCCACCCACC 960 

ACTAGTGAAA AAGCAGCTTT TTAGGCAGCT CAGAGAGTGT CCGGAGCAGG GTTCCTACAG 1020 

GCTCAGGTTG GGGGCAGTTT CAGTGTGTGG CTGCCGGCCC AGGGCCCTCC CTGGTCAGCA 1080 

CCTAGGTAGA GAGGAAGCAG GAAGGACCAA GTCCAGCTCA AAGGAACCAG TGAGGCTGGT 1140 

TGAGAGCAGA CGGTGGGTAT GGGTCTGTGA AAAGAAAAGT AGGCCGTTGG GAGCCCACCG 1200 

GAGGACAGTC TGGCTCAAGG CATCACACCT GGTTGCAGCA GCCTCCGAGA CAGCAGCCCG 1260 

TGGGTGTGAA CTTGGCTCCA TCTTGGCTGA ATTCTGCTGC CCCAGAGAAG AATCTGGCAC 1320 

CGGAGTGGAC CCAGATACGT GTCACAGCCT TGGAAATCG ATG GCA GCG AGA CCC 1374 

Met Ala Ala Arg Pro 
1 5 

TTT ATT GGG CAC CTA GAG ATT TCC GAG AGG CGG TTC CTA ACA TTG TCA 1422 
Phe lie Gly His Leu Glu He Ser Glu Arg Arg Phe Leu Thr Leu Ser 

10 15 20 

TTC CTT TCC CAC AGG CCT CAG GTC CAG GCC CAA AGC CAA GTG ACT GCC 1470 
Phe Leu Ser His Arg Pro Gin Val Gin Ala Gin Ser Gin Val Thr Ala 

25 30 35 

CTC ATC GGC TCA GCA CAT CGG CGG GTG CTG GAT GTG CCG CCT GCA GAG 1518 
Leu He Gly Ser Ala His Arg Arg Val Leu Asp Val Pro Pro Ala Glu 
40 45 50 

GCT CTT CGG GGC CAG GCT GGG ACT GGG CCC CTG AGA AAA CCC AAA CTC 1566 
Ala Leu Arg Gly Gin Ala Gly Thr Gly Pro Leu Arg Lys Pro Lys Leu 
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55 60 65 

TGT GCC TTA CCC AGT CAG GAT CCT TGG GAG CCT TGG CCC TCG ATG CCA 1614 
Cys Ala Leu Pro Ser Gin Asp Pro Trp Glu Pro Trp Pro Ser Met Pro 
70 75 80 85 

GGA CCT GGC CTA GCA TCA CCC TGG GCC ACC TGC ACG TTC ATG GCC TCA 1662 
Gly Pro Gly Leu Ala Ser Pro Trp Ala Thr Cys Thr Phe Met Ala Ser 

90 95 100 

CCA TCT GCT TTG CGA ACC ACT ACC CAC CCC CTG TCG CAT GCA CCT TCC 1710 
Pro Ser Ala Leu Arg Thr Thr Thr His Pro Leu Ser His Ala Pro Ser 

105 110 115 

i 

ACC CCT GCG GTG TCT GAC TCT. AGG GAG ATA GGT ACA GAT GTC TGT GAT 1758 
Thr Pro Ala Val Ser Asp Ser Arg Glu lie Gly Thr Asp Val Cys Asp 
120 125 130 

GGG CTG AGA CAT CCC CCA CCC CCC ACG GCT CAT TGG TTT CTT TTA GCT 1806 
Gly Leu Arg His Pro Pro Pro Pro Thr Ala His Trp Phe Leu Leu Ala 
135 140 145 

GAG TGC TGT TTT CTG TGT CTT TAC TCT GCT ATG ATG ACT TGG GGG GTT 1854 
Glu Cys Cys Phe Leu Cys Leu Tyr Ser Ala Met Met Thr Trp Gly Val 
150 155 160 165 

AGT GAG GGC CTG AGG GAT GCT TGT TGG AAG TAC TGT GGG TAGGGCCTGG 1903 
Ser Glu Gly Leu Arg Asp Ala Cys Trp Lys Tyr Cys Gly 

170 175 

TAGGTCCTTA GGGAACAAAC AAACAAAAGG CTCATCTGCT CCTGGGTGGT GACCTGGCCA 1963 

CAGCACCTGT CATCTCCCTT GGTGTCTGCC CAGTGGGGAC GAGCTTCCCT GGCTCAACCC 2023 

CTATCAGGTG TGCACTGTTG AGTCTGCGGG CCAGCCGGGA CTTTAACCAA TAAAGAGCAA 2083 

CCTTGTCTGT CAAAAA 2099 



(2) INFORMATION FOR SEQ ID NO: 62: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 178 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62: 

Met Ala Ala Arg Pro Phe lie Gly His Leu Glu lie Ser Glu Arg Arg 
15 10 15 

Phe Leu Thr Leu Ser Phe Leu Ser His Arg Pro Gin Val Gin Ala Gin 

20 25 30 

Ser Gin Val Thr Ala Leu lie Gly Ser Ala His Arg Arg Val Leu Asp 
35 40 45 

Val Pro Pro Ala Glu Ala Leu Arg Gly Gin Ala Gly Thr Gly Pro Leu 
50 55 60 

Arg Lys Pro Lys Leu Cys Ala Leu Pro Ser Gin Asp Pro Trp Glu Pro 
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65 70 75 80 

Trp Pro Ser Met Pro Gly Pro Gly Leu Ala Ser Pro Trp Ala Thr Cys 

85 90 95 

Thr Phe Met Ala Ser Pro Ser Ala Leu Arg Thr Thr Thr His Pro Leu 

100 105 110 

Ser His Ala Pro Ser Thr Pro Ala Val Ser Asp Ser Arg Glu lie Gly 
115 120 125 

|l Thr Asp Val Cys Asp Gly Leu Arg His Pro Pro Pro Pro Thr Ala His 
130 135 140 

Trp Phe Leu Leu Ala Glu Cys Cys Phe Leu Cys Leu Tyr Ser Ala Met 
145 150 155 160 

Met Thr Trp Gly Val, Ser Glu Gly Leu Arg Asp Ala Cys Trp Lys Tyr 

165 170 175 

Cys Gly 



(2) INFORMATION FOR SEQ ID NO: 63: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 1210 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(vii) IMMEDIATE SOURCE: 

(B) CLONE : HW070 

(ix) FEATURE: 

(A) NAME /KEY: misc_f eature 

(B) LOCATION: complement (639.. 928) 

(D) OTHER INFORMATION: /label* SAC_24286 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63: 

GTCGACGCAG ATTTTGATGG CGTCCTTTAT CATATTTCAA ATCCTAATGG AGACAAAACA 60 

AAAGTGATGG TCAGTATTTC TTTGAAATTC TACAAGGAAC TTCAGGCACA CGGAGCTGAT 120 

GAGTTACTAA AGAGAGTATA CGGAAGTTTC TTGGTGAACC CAGAATCAGG ATACAATGTG 180 

TCTTTGCTAT ATGACCTGGA AAATCTACCT GCATCCAAGG ATTCTATTGT GCATCAGGCT 240 

GGCATGTTGA AGCGAAATTG TTTTGCCTCT GTGTTTGAGA AATACTTCCA ATTCCAAGAA 300 

GAGGGCAAAG AAGGAGAGAA CAGAGCAGTT ATCCATTATA GAGATGATGA GACCATGTAT 360 

GTGGAATCTA AAAAAGACAG AGTCACAGTA GTCTTCAGCA CAGTTTTTAA GGATGACGAC 420 
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\Jf\ 1U X TjrfJf X Cri 


X x uoxirlAuu x 


omrnp a rpnp A n 
Oi ILAluLnu 


GAGTTCAAAG AAGGACGAAG 


AGCCAGCCAC 




nv«nwl> V* ^» v„ v* 


Afifi f PPP f T w P T T w P 
rtVJVJ X V-V. X X X X 


TAftPPAPAfin 


GAACCTCCTC TAGAACTGAA AGACACAGAT 




X UCLU X UU 


HTftAPAAPAT 


TftfiPTAPATP 
x uuu x ACn x w 


ACCTTTGTGC TGTTCCCTCG 


CCACACCAAT 


goo 


fiPPAPTfiPTP 


fiAGAPAAPAP 


PATPAAPPTR 


ATCCACACGT TCCGGGACTA 


TCTGCACTAC 




PAPATTAAfiT 


OPTPTAAGflP 


PT A T A 'PTP A T 


ACACGAATGC GAGCAAAAAC 


ATCCGACTTC 


720 

f £U 


PTTAAf^TGP 


TPAAPPGTGP 


APGPPPAftAT 


GCCGAGAAAA AAGAAATGAA AACAATCACG 


780 


firSRAARAPTT 


TTTP ATPPPG 


PTAAPTPTTR 
v- x nnu x v» x x vr 


GGAACTGAGG AGGAAGCACT 


GTTGACCGAA 


Ran 




PTTWf APTfi 

1 UL X AVw X VJ 


fiATAATPHTA 


GCCCTTCATG TTGCACCTTC 


TCCAGGTTCT 






XV^V^VJX X X\3\3\3 


x lutni X X X\s 


TACGAGTTTG GCAAATACGC 


TGCAGAAACG 




AGCTGTGCTT 


GCAAGGACTT 


GATAGTTCCT 


AAGAATTAAA AACTCACTTG 


ATCAACTTAA 


1020 


TTCCCTTTTA 


TTTTTCCTCC 


CTCACTCCCC 


TTCCTTTTCC AAGCTGTTTG 


CTTTGCAATA 


1080 


TGTTACTGGT 


AATGAGTTGC 


AGGTAATGCA 


ATCTTAACTT GTTTTCTTCT 


AAGTATTTGA 


1140 


GTTCAAAACT 


CCTGTATCTA 


AAGAAATACG 


GTTGGGGTCA TTAATAAAGA 


AAATCTTTCT 


1200 


ATCTTAAAAA 










1210 



(2) INFORMATION FOR SEQ ID NO: 64: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1844 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 



(vii) IMMEDIATE SOURCE: 
(B) CLONE: HW073 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 9.. 1223 

(ix) FEATURE: 

(A) NAME /KEY: misc_£eature 

(B) LOCATION: 598.. 1036 

(D) OTHER INFORMATION: /label* SAC_24456 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64: 

GTCGACCT GCC GGC CGA CAG GGC CGC GGG ATG AGG AAG CGG ACC GAG CCC 50 
Ala Gly Arg Gin Gly Arg Gly Met Arg Lys Arg Thr Glu Pro 
15 10 

GTC ACC TTG GAG CAT GAG CGC TGC GCC GCC TCA GGC TCG TCT TCC TCC 98 
Val Thr Leu Glu His Glu Arg Cys Ala Ala Ser Gly Ser Ser Ser Ser 
15 20 25 30 



-148- 



SUBSTITUTE SHEET fRULE 2fi\ 



WO 98/53071 



PCT/US98/10547 



GGC TCG GCC GCC GCG GCG CTG GAC GCC GAC TGC AGC TTG AAG CAG AAC 146 
Gly Ser Ala Ala Ala Ala Leu Asp Ala Asp Cys Ser Leu Lys Gin Asn 

35 40 45 

CTG CGT CTG GCG GGC AAG GGG ACG GCA GAG CCG CAC AGC GCA TCC GAC 194 
Leu Arg Leu Ala Gly Lys Gly Thr Ala Glu Pro His Ser Ala Ser Asp 

50 55 60 

GCG GGC ATG AAG CGG GCT CTG GGC AGA CGG AAG AGC CTG TGG TTC CGA 242 
Ala Gly Met Lys Arg Ala Leu Gly Arg Arg Lys Ser Leu Trp Phe Arg 
65 70 75 

CTA AGG AAG ATA CTT CTC TGT GTT TTG GGG TTC TAC ATT GCC ATT CCA 290 
Leu Arg Lys He Leu Leu Cys Val Leu Gly Phe Tyr He Ala He Pro 
80 85 1 90 

TTT CTT GTC AAA CTG TGT CCT GGG ATA CAG GCC AAA CTG ATA TTC TTA 338 
Phe Leu Val Lys Leu Cys Pro Gly He Gin Ala Lys Leu He Phe Leu 
95 100 105 110 

AAT TTC GTG AGG GTT CCC TAT TTC ATT GAC TTA AAA AAG CCA CAG GAT 386 
Asn Phe Val Arg Val Pro Tyr Phe He Asp Leu Lys Lys Pro Gin Asp 

115 120 125 

CAA GGT TTG AAT CAC ACC TGC AAT TAC TAC CTC CAG CCC GAG GAT GAT 434 
Gin Gly Leu Asn His Thr Cys Asn Tyr Tyr Leu Gin Pro Glu Asp Asp 

130 135 f 140 

GTC ACT ATT GGA GTC TGG CAC ACC ATT CCC TCT GTC TGG TGG AAG AAT 482 
Val Thr He Gly Val Trp His Thr He Pro Ser Val Trp Trp Lys Asn 
145 150 155 

GCC CAA GGG AAG GAC CAG ATG TGG TAT GAG GAT GCT CTG GCT TCT AAC 530 
Ala Gin Gly Lys Asp Gin Met Trp Tyr Glu Asp Ala Leu Ala Ser Asn 
160 165 170 

CAC CCC ATC ATC CTG TAC CTG CAT GGG AAT GCA GGC ACC AGA GGA GGT 578 
His Pro He He Leu Tyr Leu His Gly Asn Ala Gly Thr Arg Gly Gly 

175 180 185 190 

GAC CAC CGT GTG GAG CTG TAC AAG GTG CTG AGT TCC CTT GGT TAC CAC 626 
Asp His Arg Val Glu Leu Tyr Lys Val Leu Ser Ser Leu Gly Tyr His 

195 200 205 

GTG GTC ACC TTC GAC TAC AGA GGT TGG GGT GAC TCA GTA GGA ACA CCA 674 
Val Val Thr Phe Asp Tyr Arg Gly Trp Gly Asp Ser Val Gly Thr Pro 

210 215 220 

TCA GAG CGA GGC ATG ACA TAT GAT GCA CTC CAT GTT TTT GAC TGG ATC 722 
Ser Glu Arg Gly Met Thr Tyr Asp Ala Leu His Val Phe Asp Trp He 
225 230 235 

AAA GCA AGA AGT GGT GAT AAT CCT GTG TAT ATC TGG GGC CAT TCG CTG 770 
Lys Ala Arg Ser Gly Asp Asn Pro Val Tyr He Trp Gly His Ser Leu 
240 245 250 

GGC ACT GGA GTG GCA ACA AAT CTG GTC CGG CGC CTT TGT GAG CGA GAG 818 
Gly Thr Gly Val Ala Thr Asn Leu Val Arg Arg Leu Cys Glu Arg Glu 
255 260 265 270 

ACG CCA CCA GAT GCC CTT ATA TTG GAG TCT CCG TTC ACA AAT ATT CGT 866 
Thr Pro Pro Asp Ala Leu He Leu Glu Ser Pro Phe Thr Asn He Arg 
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275 280 285 

GAA GAA GCA AAG AGT CAT CCA TTT TCA GTG ATA TAG AGA TAC TTC CCA 914 
Glu Glu Ala Lys Ser His Pro Phe Ser Val lie Tyr Arg Tyr Phe Pro 

290 295 300 

GGC TTT GAC TGG TTC TTC CTC GAC CCC ATT ACA AGC AGT GGA ATT AAA 962 
Gly Phe Asp Trp Phe Phe Leu Asp Pro lie Thr Ser Ser Gly He Lys 
305 310 315 

TTT GCA AAT GAC GAA AAC ATG AAG CAC ATC TCC TGT CCC CTG CTC ATC 1010 
Phe Ala Asn Asp Glu Asn Met Lys His He Ser Cys Pro Leu Leu He 
320 325 330 

TTG CAC GCT GAG GAT GAC CCA GTT GTA CCC TTT CAT CTC GGC AGG AAG 1058 
Leu His Ala Glu Asp Asp Pro Val Val Pro Phe His Leu Gly Arg Lys 
335 340 345 350 

CTA TAC AAT ATT GCT GCG CCA TCC CGA AGT TTC CGA GAC TTC AAA GTC 1106 
Leu Tyr Asn He Ala Ala Pro Ser Arg Ser Phe Arg Asp Phe Lys Val 

355 360 365 

CAG TTC ATC CCC TTT CAC TCA GAC CTT GGC TAC AGA CAT AAG TAC ATC 1154 
Gin Phe He Pro Phe His Ser Asp Leu Gly Tyr Arg His Lys Tyr He 

370 375 380 

TAC AAG AGC CCA GAG CTT CCA CGG ATA CTG AGG GAA TTC CTA GGG AAG 1202 
Tyr Lys Ser Pro Glu 1 Leu Pro Arg He Leu Arg Glu Phe Leu Gly Lys 
385 390 395 



TCG GAA CCA GAG CGC CAG CAC TGAGCCTGGC CCGTGAAGGA GCATGGAGAC 
Ser Glu Pro Glu Arg Gin His 
400 405 


1253 


CCACCTTCCT 


TCCCTTCTCC 


CTGAACAGCA 


GTCTGGCACC 


CAGAAGCTCA 


GAGTGCCACC 


1313 


ACCTGTGGTG 


CTCAGGAGCC 


CAGCCTAGAA 


AGAGGACTCC 


GACACAGCGG 


GCAGAGGCTC 


1373 


CACAGACGGA 


TCTATGAGGA 


AAATACGGTG 


GCAGGCAGGC 


AGGCAGGCGA 


CCCCCTGACC 


1433 


CTCTGGTGGC 


CGCTGTATCT 


GAGCCCTTTT 


GGGAAGGCTT 


ATAGACAACA 


GGTGGAGCCC 


1493 


ATACGCTGGG 


CATAGGGAGC 


CTGGGAAGGG 


CTCAGGAGCT 


CAGGACCACT 


CCAGGCTCTC 


1553 


TAGCACCACC 


GCTTAAAATA 


CAGGAAAAAG 


GTTCTTTCTG 


CCCTTCCTGG 


CGTACACAGA 


1613 


ACAGATTCCA 


AGTGGTTCAA 


TTTGTCCCCT 


ACAGCTCATG 


TACCTGCTTG 


CCTTCCTCAG 


1673 


CTGTCCCTGC 


CTCTCCTGGC 


ATCTGTACAC 


CCACAGTGAG 


GGGCACCTGG 


ACTTGCACTT 


1733 


CCATTCTGCC 


CACCTGTCTG 


TCACCTAACC 


TGGCCGTAGA 


CTGAGCATTT 


ATTTAAGAAT 


1793 


AAAATCTCGG 


TGGTGGTCTA 


TTTGTTTTTT 


TCTACAAAAA 


AAAAAGTCGA 


C 


1844 



(2) INFORMATION FOR SEQ ID NO: 65: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 405 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65: 

Ala Gly Arg Gin Gly Arg Gly Met Arg Lys Arg Thr Glu Pro Val Thr 
15 10 15 

Leu Glu His Glu Arg Cys Ala Ala Ser Gly Ser Ser Ser Ser Gly Ser 

20 25 30 

Ala Ala Ala Ala Leu Asp Ala Asp Cys Ser Leu Lys Gin Asn Leu Arg 
35 40 45 

Leu Ala Gly Lys Gly Thr Ala Glu Pro His Ser Ala Ser Asp Ala Gly 
50 55 60 

Met Lys Arg Ala Leu Gly Arg Arg Lys Ser Leu Trp Phe Arg Leu Arg 
65 70 75 80 

Lys He Leu Leu Cys Val Leu Gly Phe Tyr He Ala He Pro Phe Leu 

85 90 95 

Val Lys Leu Cys Pro Gly He Gin Ala Lys Leu He Phe Leu Asn Phe 

100 105 110 

Val Arg Val Pro Tyr Phe He Asp Leu Lys Lys Pro Gin Asp Gin Gly 
115 120 125 

Leu Asn His Thr Cys Asn Tyr Tyr Leu Gin Pro Glu Asp Asp Val Thr 
130 135 140 

He Gly Val Trp His Thr He Pro Ser Val Trp Trp Lys Asn Ala Gin 
145 150 155 160 

Gly Lys Asp Gin Met Trp Tyr Glu Asp Ala Leu Ala Ser Asn His Pro 

165 170 175 

He He Leu Tyr Leu His Gly Asn Ala Gly Thr Arg Gly Gly Asp His 

180 185 190 

Arg 'Val Glu Leu Tyr Lys Val Leu Ser Ser Leu Gly Tyr His Val Val 
195 200 205 

Thr Phe Asp Tyr Arg Gly Trp Gly Asp Ser Val Gly Thr Pro Ser Glu 
210 215 220 

Arg Gly Met Thr Tyr Asp Ala Leu His Val Phe Asp Trp He Lys Ala 
225 230 235 240 

Arg Ser Gly Asp Asn Pro Val Tyr He Trp Gly His Ser Leu Gly Thr 

245 250 255 

Gly Val Ala Thr Asn Leu Val Arg Arg Leu Cys Glu Arg Glu Thr Pro 

260 265 270 

Pro Asp Ala Leu He Leu Glu Ser Pro Phe Thr Asn He Arg Glu Glu 
275 280 285 

Ala Lys Ser His Pro Phe Ser Val He Tyr Arg Tyr Phe Pro Gly Phe 
290 295 300 

Asp Trp Phe Phe Leu Asp Pro lie Thr Ser Ser Gly lie Lys Phe Ala 
305 310 315 320 
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Asn Asp Glu Asn Met Lys His lie Ser Cys Pro Leu Leu lie Leu His 

325 330 335 

Ala Glu Asp Asp Pro Val Val Pro Phe His Leu Gly Arg Lys Leu Tyr 

340 345 350 

Asn lie Ala Ala Pro Ser Arg Ser Phe Arg Asp Phe Lys Val Gin Phe 
355 360 365 

He Pro Phe His Ser Asp Leu Gly Tyr Arg His Lys Tyr He Tyr Lys 
370 375 380 

t ( 

Ser Pro Glu Leu Pro Arg He Leu Arg Glu Phe Leu Gly Lys Ser Glu 
385 390 395 400 

Pro Glu Arg Gin His 

405 

(2) INFORMATION FOR SEQ ID NO; 66: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 855 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 



(vii) IMMEDIATE SOURCE : 
(B) CLONE: HW074 

(ix> FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 7.. 300 

(ix) FEATURE: 

(A) NAME /KEY: misc„feature 

(B) LOCATION: complement (64.. 365) 

(D) OTHER INFORMATION: /label= SAC_24464 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66: 

GTCGAC AGC CCT CGA GGG AGC AGA CGC CGC GGG CCA CTG GCT TCC TCT 48 
Ser Pro Arg Gly Ser Arg Arg Arg Gly Pro Leu Ala Ser Ser 
15 10 

ACT CAG GAA CCA CGG ATG TCA CCG CCC AGT CAG CTG TGT CTC CTC ACC 96 
Thr Gin Glu Pro Arg Met Ser Pro Pro Ser Gin Leu Cys Leu Leu Thr 
15 20 25 30 

ATT GTC GCC CTG ATT CTG CCT AGT GAA GGG CAG ACA CCA GAA AAA CCC 144 
He Val Ala Leu He Leu Pro Ser Glu Gly Gin Thr Pro Glu Lys Pro 

35 40 45 

AGA TCC AGT TTT ACA GCG CAC CAG AGT TCT GTG ACT ACT CAT GTC CCA 192 
Arg Ser Ser Phe Thr Ala His Gin Ser Ser Val Thr Thr His Val Pro 

50 55 60 

GTT CCA GAT CAA ACT AGC CCA GGA GTC CAG ACC ACT CCT CCC ATC TGG 240 
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Val Pro Asp Gin Thr Ser Pro Gly Val Gin Thr Thr Pro Pro lie Trp 
65 70 75 

ACC AGT GAA GCT GCG AAG CCA CAG GAA GCC AGA CAG CAG CCA AAA CCA 288 
Thr Ser Glu Ala Ala Lys Pro Gin Glu Ala Arg Gin Gin Pro Lys Pro 
80 85 90 



AGA CCC AGC AAC TGACCGAAAT GGCCACTGCG AATCCAGTGA CAGATCCAGG 340 
Arg Pro Ser Asn 
95 



GCCACTTACA 


AGCAGCGAGA 


AAGGTACCCC 


GGCACTCTCC 


AGGATCAAAT 


CTCCCAGCCC 


400 


ACCCAAAGGT 


TACATGCCTC 


CATCCTACAT 


TGAGAATCCA 


CTGGATCCCA 


ATGAGAACAG 


460 


CCCCTTCTAC 


TACGACAATA 


CCACCCTCCG 


GAAACGGGGG 


CTGCTGGTGG 


CGGCAGTGCT 


520 


GTTCATTACT GGAATTATCA TCCTCACTAG TGGGAAGTGT 


AGACAGTTCT 


CTCAGTTATG 


580 


CCTGAATCGC 


CACAGGTGAG 


TGGGAGCCAG 


CACCCTGATG 


GGCACCCCAA 


CTGGAGCCGC 


640 


CATACCATAC 


CAGTTCACCA 


CCCCTGCCTC 


CCTCCCTCTG 


CTCCAAGAGC 


CAACAGAGTG 


700 


GTCAACATAA 


ATGGATCCTC 


AAAGGAAGAG 


GCCACCGGAG 


GGAGCCAGGC 


CTAAGGCTAA 


760 


ATGGTCTTCC 


CACCCTGAGG 


AGAGAGGTCT 


CCCCAGGCAC 


TGCTGTGATC 


CTGCCTATCC 


820 


TGTTCAGATA 


AATCCACATG 


GTCTCTCTTC 


AAAAA 






855 



(2) INFORMATION FOR SEQ ID NO: 67: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 98 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67: 

Ser Pro Arg Gly Ser Arg Arg Arg Gly Pro Leu Ala Ser Ser Thr Gin 
1 5 10 15 

Glu Pro Arg Met Ser Pro Pro Ser Gin Leu Cya Leu Leu Thr lie Val 

20 25 30 

Ala Leu lie Leu Pro Ser Glu Gly Gin Thr Pro Glu Lys Pro Arg Ser 
35 40 45 

Ser Phe Thr Ala His Gin Ser Ser Val Thr Thr His Val Pro Val Pro 
50 55 60 

Asp Gin Thr Ser Pro Gly Val Gin Thr Thr Pro Pro lie Trp Thr Ser 
65 70 75 80 

Glu Ala Ala Lys Pro Gin Glu Ala Arg Gin Gin Pro Lys Pro Arg Pro 

85 90 95 

Ser Asn 



(2) INFORMATION FOR SEQ ID NO: 68: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 4315 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 



(vii) IMMEDIATE SOURCE: 
(B) CLONE: HW075 

<ix) FEATURE: j 

(A) NAME /KEY: misc_£ eature 

(B) LOCATION: complement (37B5 . . 4014) 
(D) OTHER INFORMATION: /labels SAC_24466 



ixi) SEQUENCE DESCRIPTION: SEQ ID NO: 68: 








GTCGACATGG 


CGAAGTACTC 


TAGCCTGGAA 


GCCAAGCTCT 


GCCAGGTTGA 


AAGTAAATAC 


60 


TTGATACTGC 


TCCAAGAAAT 


GAAGACACCA 


GTTTGCTCAG 


AAGAACAGGG 


CCCTGCCAGG 


120 


GATGTCATAG 


CCCAGTTGTT 


GGAGGACGCT 


CTGCAGGTTG 


AGAGCCAAGA 


GCAGCCAGAG 


180 


CAAGCGTTTG 


TCAAGCCTCA 


TCTGGTCAGT 


GAGTTCGATA TCTACGGGTT TAGGACTGTC 


240 


CCTGACGATG 


IV m/""» Ti r%fl lint 

ATGAGGAAGA 


GAAATTGGTC 


GCCAAGGTCC 


GAGCATTGGA 


CCTGAAGACT 


O Aft 


CTGTACCTCA 


CAGAAAACCA 


GGAAGTTTCC 


ACTGGGGTCA AGTGGGAAAA 


CTATTTTGCA 


360 


AGCACAATGA 


ACAGGGAGAT 


GGTGTGCTCT 


CCTGAGCTGA AAAACCTGAT 


CCGAGCAGGC 


420 


ATTCCCCATG 


AGCACCGCTC 


CAAGGTGTGG 


AAGTGGTGTG 


TTGACCGTCA 


CACCAGGAAG 


480 


TTCAAGGACA 


GCATGGAGCC 


AGGCTACTTC 


CAGGCCTTAC 


TCCAGAAGGC 


TCTAGAGAAA 


540 


CAGAACCCGG 


CCTCCAAGCA 


GATTGAGCTG 


GACCTGCTTC 


GGACTCTGCC 


CAATAACAAA 


600 


CATTACTCCA 


GCCCCACGTC 


AGAGGGCATA 


CAGAAGCTTC 


GCAATGTCCT 


GCTTGCCTTC 


660 


TCATGGAGGA 


ATCCGGATAT 


TGGCTACTGC 


CAAGGCCTAA ACAGGTTGGT 


GGCAGTGGCG 


720 


CTCCTTTACC 


TGGAACAAGA 


GGATGCTTTC 


TGGTGTCTCG 


TTACCATTGT 


GGAAGTCTTC 


780 


ATGCCTCGAG 


ACTATTACAC 


AAAGACTCTA 


TTAGGATCCC 


AGGTGGACCA 


GCGGGTGTTC 


840 


AGAGACCTCC 


TGAGTGAGAA 


GCTGCCTCGA 


CTGCACACCC 


ATTTTGAGCA 


GTACAAAGTG 


900 


GACTACACCC 


TCATCACCTT 


CAACTGGTTT 


CTGGTGGTAT 


TCGTGGACAG 


CGTTGTCAGC 


960 


GACATCCTCT 


TTAAGATATG 


GGACTCTTTC 


CTTTATGAGG 


GACCAAAGGT 


TATTTTCCGT 


1020 


TTTGCCCTGG 


CACTTTTTAA 


ATACAAGGAA 


GAGGAGATCC 


TGAAGCTGCA 


GGATTCGATG 


1080 


TCCATTTTCA 


AGTATCTCCG 


ATACTTCACT 


CGCACTATCC 


TTGATGCCAG 


GAAGCTGACC 


1140 


AGCATCTCCT 


TTGGGGATCT 


GAACCCCTTC 


CCCCTGCGCC 


AGATCCGGAA 


CCGGAGAGCC 


1200 
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TACCACTTGG AGAAGGTCCG GCTGGAGCTG ACAGAGCTGG AGGCCATTCG AGAGGACTTC 1260 

CTGCGTGAGC GGGACACTAG CCCTGACAAA GGCGAGCTGG TTAGCGATGA GGAGGAAGAC 1320 

ACTTGAATGA ATCCCCACCC CAGGATGCTG CCTTCCTTCT TGCCTTCCTT CATAACCAAA 1380 

AGTAGCAAAG GCGCATCTGC ACATCAGCTG TCAGAGTGAA ACATCTACAG CCTCTGGGCA 1440 

CCTGCGGGGC AGATGTCCAA AGCTAAGCTG CACTTTCTCG TCTGCCATGA GGGCAGGGGT 1500 

CTGATGGCTT TACAGCCATT TCCATGCCGT GGCATTCAGT CATATGCGTC ATATGCTTGC 1560 

TGCCTGATGG GTGAGTCACC AGGCGACTGT TTCCAAGGGC CTGTTTACAT CATCTGCCAG 1620 

TTGTATGTTT CACCTACAGA AACTGTTTGA CATTTCCCTG TGCGCTTTCT TTTTATCCAA 1680 

GGGCAACTCT GTGGAGGAGT GCACATGGAG CTGCAGCGTG GCTGCATCCA GGGGTAAAAG 1740 

GTCACCATAA GAAGAGGGAC CTCTTTAAAA AAAAGCGAGC ACAGAACTGG CCTCCCTAGA 1800 

i 

GAGCGTGGGA GCCCTCTGCT GGTGGCTTCG CACTGCCAGC CACTATCCCT TCTGGACTAA 1860 

CCTTTGCCCT CCTATACAGC CTTTTCAGAT TGTAGCACTC ACGCTGCTCT GCAGCCAGCC 1920 

CCACACTTCT CATTGCCAGC TATTCTGCCA GATGTCTGTC CCAGCTTACA GTGGGGCTAG 1980 

i 

CGCATCTGGG AGGTGACTTA TAGACAACAT TCTCCAACTG TGGCTAGATT GGCGTTCCAA 2040 

AGGAACCTTA CAGTAACCCA CCCCTGCATA AAGGAACACA GGAAACTCCT ATCTGTGGCT 2100 

TGTGGGCTCC TGGATGGCAG TGGCCAGGGC TCTAGAATCT CTTCTGCCCA ACAGCTGGGT 2160 

GGGGAGTAGA GCTGACTGCC AGCTTCCTAC ATCTGGAAGC ACCTTTGCTT GGGAACAAGA 2220 

GTAATGAGTT CTCCTAGTCT CTGTGTTGCC TGGATCATCC TTGTCCTTAA CAGGGATCTG 2280 

AGGACACTGT GGCCTCTGGT ATAGCAGACT CTGGGGACTT GGAACACTTC CTTCTTGAAG 2340 

CCCAGAGAGG GTCTGAACAA AAAAGGTTAT TTTTGCTCAG TCCCAGAGCA TCCCTAATGG 2400 

TGGCTGTTAT GAGAAAACAT CTTCTTACAC AGTATTATGG GAACACAAGG GAAACAGCCT 2460 

CAAGTGGACT ACTTTACACT ACCTAGGGAA AAGAAATTAG AAACAATCCT TAGTGTAAAC 2520 

ACACTCAGAC TGAGGAGTGG AGCAAGACAG TTTCGAGAAA CGCTTTTTCC ATGACCCGTT 2580 

TTCCTGTTTT TGCTGTCTGC TTTCTAGCAC TTGAATGGCT GAGTCCATTT TCAGCATCTA 2640 

AATTTGCTTC ACTTCTCTAA AGTTGCTTCA TATATTACCA TGTAAAAGGA GCTGGTCTCT 2700 

ATATTTAGCC AATAGCCCGT CTCTTTTCAG GTATCAGAAA TGTCATCCTT GTGTCTTTGA 2760 

GATGATGGTT TGAGGCCCGG CTTCTTGAAG AAGCTGTCTG GAGCTGGACA GTGATCTATC 2820 

TCACCACTGT TCCTAGGCCT CCCTGAGACC AAATGCTGGA AGATGCCCCA CCCCCTGTAA 2880 

GATGCCGGCT GTCAGCCTTG GGGACAGCTG TTAGGAGGTT AGAGTTGGAG CCCCATGGCT 2940 

TAGCTCCAGC CGCTCTGATT TCGGTTGTCA TGTCAATTTC CTACCAGAAG ATCCAGTCCA 3000 

AATAAGTATT TGGTGTGGAT GCTCTCAGTG GAGCATTTGG TGTGGATGCT CTCAGTGGTG 3060 

GGTTGCTTTC GCCCTGGCAC CAGTGGAGGC ATCTTTAAGT ACTTGGTGAT GCTGAGGACA 3120 

-155- 



WO 98/53071 



PCT/US98/10547 



GGACCATGTT 


CCTGGCATGC 


GGGGCCCTGG 


GTTCCATCCC 


AGTGATGAAA 


AAAAATCTTC 


3180 


AGTATGTGGG 


GATTGTGGGG 


AGGCGCTGTT 


CTGCCATCCC 


CTGAGAGACA 


GCATTAGTTT 


3240 


CTCAGGCAGT 


TAGTATCAGT 


GCCCAGGTAA AAGTCTCAAG AGGTCCCAAA GGGACTCTCC 


3300 


TCCCTGGAAT 


AAATGACTCG 


ATGTCTGCAG 


AGCGGAGAAG 


GGACGGTTGC 


CCTCTTTCCA 


3360 


GTTTGGGGAC 


AGAATAACAG 


AGATTCATCC 


CAGAGGCACA GCCCCAGTGG 


TTGAGCCATC 


3420 


GTCTCTCATG 


CCCTCGTCAC 


TGAGATTGTC 


TTTAPPPfPP 


ARTAPTPPTR 


AGAPTPPARA 


3480 


CTTCTACCAG 


ACACCATGGC 


AGGTACCCCT 


AARRTPARTR 


Uk* X ULU X l*/tUf 


RPTRPTTTTP 


3540 


ATCTAAACCT 


TTACCCAGCA 


1 

TRPAAPPPAT 




TRRTR ARPPT 


X w X x un 


3600 


GAAACCCAAA 


GCACAGCCTC 

j 


^nUWul, \3 X X3\3 


RTPPAAATRP 


RTV^TRPP A P & 
\J XAJ X V?l«l»xlV*Xl 


RTARRRRARP 


3660 


ACCCTGCCCT 


CAGCACCCCA 


TATRRTRRRA 


PTtfRPPARTA 


TTTV A APT'RT 1 
XXX lul 


x vszi i hv> x x x x 


3720 


ATATTTTCAC 


TACATTTTAT 


nX X X 1 aAXAVS 


AP'TATTI'TAT 


ATTA ARRTVPT 


'PT"T 1 P'T' A R A T W T 1 
1 1 x\- Xx\o/\ X 1 


3780 


CTGTACAACT 


TTTATATAAC 


AARTTPTATHT* 


TTGTGTGCAC 


ATCTCCCCTG 


f A TRf A TTT 1 A 


3640 


TAGCAACCCA 


AGCCTCTCAA 


ATTPPARRTP 


CCTCTTGTGC 


CACCTTTTCC 


PPTTARPPTT 
V*V* X JLAOVrfV. X X 


3900 


GACTGTCACT 


CTCTTCTGCT 


CACCGAGGTA 


GCCTCGGGGG 


CTACTTCTAG 


CCACCATGAC 


3960 


AGCCAGACAG 


CAGGAACTGG 


TGTTTGTCAG 


CTGTGGTCTG 


TTGATACAGG 


AAGTACTCTG , 


4020 


GGCAGTAACC 


CAGTAGCTAT 


GTTACTGGAC 


AAGCCCTTCT 


CGCTTAGTGC 


AGTGATGTCC 


4080 


TGGTGCAGCC 


ACCTGTGGGC 


GGGTGGCTGG 


ACCCATCACG 


AAAGTCCTTG 


AAGACAGAGG 


4140 


CTAGATGAAA 


GCCCAGTGTC 


TGAGGTCCTT 


CTGTGGGCAA 


CTGGAGGACT 


TGAGGGCACA 


4200 


CTGCCAAGTG 


ACTGTCCTTT 


TATATCCTAG 


CAGCTTCTGT 


GCTCACCCTT 


GAGTTAAATG 


4260 


TTAGACTTTT 


AAAGGAAGAA 


ATAAACTAAT 


TTGTATTACA 


TTCTGCCTCG 


AAAAA 


4315 


(2) INFORMATION FOR SEQ ID NO: 69 


• 
* 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 1668 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii> HYPOTHETICAL: NO 



(vii) IMMEDIATE SOURCE: 
(B) CLONE: HW076 

(ix) FEATURE: 

(A) NAME /KEY: misc„£eature 

(B) LOCATION: complement (42.. 281) 

(D) OTHER INFORMATION: /label* SAC_24409 
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<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69: 



GTCGACTTTT GCCTTTAAAA TTCTGTTTAC 


TTAGCATATG TACACGTAAA AGAGAAATCA 


60 


CCCTTAGATC 


CCACCCAGCA 


GAAAACATGC 


ACAAACCCAA GGTATATGTG 


AAAGGGAGGG 


120 


CCCCAGCCCC 


AAGTTAGCAG 


TTGGGATGGT 


GCAATCCTGT GAGTCACAAG 


CCCATGGCAA 


180 


GAGGGCTGGA CTAAAGTTTC AACACAGCCA GAACCCCTGC TGCTGGCCCT AAAATAAGCG 


240 


TGCACCATAG 


CCAAAGGTCT 


GTTTGATGCC 


CTCAAAGTAG TACCGCTGCC 


AGCCTTGCCG 


300 


TGTCCGCTCT 


TCCTCGGGAG 


CAGGGATGCC 


GCGGCCCTCC ATGCACAGCT 

i 


CTGTCTCTCC 


360 


ATTCTTATCA ATGAAGGTCA AGGTGATGGT 


CGCGAAGTGC CCCTCTGGCC 


ATGACTTAAA 


420 


CCTCCACTTC 


ATAGCAATAT 


GTTTCTCAGG 


AACCAAGTCA GTAAACTCCC 


CAGTGACGTT 


480 

*F 


ACCATCGACC 


ATGTGAAACT 


TCCCTCCTCT 


GTCTGCTTCC AAGGCCGCAG 


GAGCATGGGT 


540 


AAAGGCCTGG 


ACCAGCTCCT 


GCGTGGTGAA 


CACTCTGTAG AGCTCCTCTG 


GAGAGGTCAG 


600 


GAAGGTTTCT 


TTGAGGGTGA 


TCTTACAAGT 


GGGTATTTTG ACACCAACAG 


GTTTGGCCTG 


660 

w w v 


GCTTTTTGAA GGAGCACACT 


TAGCCTTGCA 


CACTTCCGTT TTTAGTGCTG 


GTTGGCCCAC 


720 


TGGGTCTACT 


GATTCTCCAT 

i , 


TCACTGTGGG 


CAAGATCATG CCCTGAGTAA 


ACTCTGTCTT 


780 


GAGGGTGCTG 


ATGTAAATTC 


CCACTGCTTC 


TCTGAGAAGT TTCACCCCGT 


CTTCCTTCAT 


840 


TAAGGCCACG 


AGACTTGTGT 


CAGGCTCATC 


TTTGGCAAGG CTGCTTCTTA 


GAGCCCTTGT 


900 

mf W W 


TATGGCAGCG 


CTCCATCTAG 


GAAACAGCAG 


TCAGTCTCTC CTGCTCACTG 


TTCTCCCTCT 


960 

mf W W 


CTGTGCTCAA 


GTCTGTCAGT 


GCTCGCTGGC 


CTCTGGCTCT TCCCTTTGGG 


GAGGGAGCGG 


1020 

JL W m V 


CTGTTCTGCC 


TGCTTCACCG 


AGTCTTCACA 


AGCTGAGGGG ATGAGGCCAG 


GTTAGTACCA 


1080 

■to W w V 


GGGACCAGCA 


ACCTTTCAGG 


GCTCCTTGGG 


GGTCTCAGGT CCACACACAG 


TGAGCATCCC 


1140 

*A. JL 7 V 


TTGCCAGGTG 


CCCCATGTAT 


CTGTCAGTGC 


TGAGACGCCA AACACACAAC 


CTGTTCCCAG 


1200 

J*m M W W 


CTGCTGCTGG 


GGGCCACTGT 


GACCGAGGAG 


AGTGTGGGGT TGCCAGGGAG 


AGCGCTGTCC 


1260 

4»» v V V 


CTCTGACAGC 


CTGACCAGAT 


GATGTAGCTT 


TGAAAGCCGG CAGCATGTGA 


GTAGCAAGAT 


1320 


GTGACCTTGA 


GCTAGCTGAG 


CAGAACACAG 


GTGGATGGGC AGAGGCCCCC 


ATCCTGGCGA 


1380 


GCTGAGCCTG 


GCTGTGTCCT 


GTCAGGACGC 


CCTACTCTGT GACCCTACCC 


TGTGATCTGT 


1440 


GAAGTGGTGC 


AGTCGGAAGA 


CGAGGCTGGA GTCCTTTACA AGTTTCTACA CTAGATTGTA 


1500 


GCCAGGTTTT 


CTCCCAGGCC 


TTAGGCTTTG 


GTTTACTTTT AATTTTAATT 


TCACTGTTAC 


1560 


CCTTGACTAT 


TGTCCTTTTT 


GTTGCAGTGT 


TGGAGAGGCA GGAGGTATTT 


GTGCCTCATC 


1620 


TATTACTGCA AAAAAATTAA 


ATAAATGTAA 


CAATAAACGT CCTAAAAA 




1668 



(2) INFORMATION FOR SEQ ID NO: 70: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH : 1019 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: HW078 

(ix) FEATURE: 

(A) NAME /KEY: misc_feature 

(B) LOCATION: complement (211. .743) 

(D) OTHER INFORMATION: /label= SAC_24345 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70: 



GTCGACGCCA 


GCGTGAGCGC 


CCTGCAGCGC 


CTGGTGGAGC 


jAGCTGAAGCT 


GGAGGCCGGC 


60 


GTGGAGAGGA 


TCAAGGTCTC 


TCAGGCAGCT 


GCAGAGCTTC 


AGCAGTACTG 


CATACAGAAT 


120 


GCATGCAAGG 


ACGCCCTGCT 


GCTTGGTGTT 


CCGGCTGGAA GCAACCCCTT CCGGGAGCCC 


180 


AGGTCCTGTG 


CTTTACTTTG 


AAGACTGGGA 


AGAAGCTCTC 


TGAGGAACCT 


GTGTGTGCAT 


240 


GGTGATGGAT 


GACTACCGCC 


AAGTCCCAAG 


AAAACAGATT 


TCAAGAGCCA 


AATCATTTTC 


300 


TCTGTGGCCT 


GGTCCTAGGT 


CAAAATTCTA 


CAGAAATCAA 


TGTTTTGTCT 


TATTTTTTAA 


360 


CCTCATTACC 


AAACTCACTG 


GGTGCATCTC 


TCAGTCAAGC 


CTGTGCGGCC 


CTGCCTGGGA 


420 


TGGAGGCATG 


TTTTGTTTAC 


TGGGTATTTG 


CATTTCTGGA 


TGGAAATGTT 


GTATTTCCGA 


480 


AAACTGAGTA 


TTTGTAATTT 


TCCTAACATT 


TTTATACTGT 


AATAAAAAAT 


TTTCATAAAA 


540 


ATTTACAGTT 


ACTAAATTAG 


ACAAAGTATA 


AATTGGGGAA 


TGATTGATAT 


GCTGGTTTCG 


600 


TAAAGGAGGC 


CTTTCCAGTC 


CCAGTGAGCA 


AACGGTCTGA 


CCTGGACGGG 


AAGCTGGGGA 


660 


GCCGCTCCTT 


AGCACACGGA 


GCATGGATGT 


GTTGTTAGGT 


GGCCCTCCCG 


TGTGAAGCCA 


720 


TTAGAACTTT 


CTTTAGTTCC 


TGTACTCTAC 


AACTTATAAT 


TGTGTCTGAT 


GCTTTGAAAA 


780 


GTGTGCTCAT 


GCCTTTAAAA 


ATGTTTAAAT 


GTGATCTACA 


CTTAGAACGT 


TGCAATTGGT 


840 


TAAAAACAAA 


ATATGAAGAT 


ATTATAACCT 


AAAAGAATTC 


ATGTCACACA 


TATTTTATTG 


900 


GGGGTGATGT 


GCCTTTGATT 


TAATTGGGGA 


CACTTTTAGA AGGAAAGTTT 


GTGTTTGTGA 


960 


TTATCTTTGA 


AGAGCTTGGA 


AATAAAATTT 


TTGCCTAATT 


CATCATTTTC 


CATGAAAAA 


1019 


<2) INFORMATION FOR SEQ ID NO: 71 


• 
* 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 961 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL; NO 



(vii) IMMEDIATE SOURCE: 
(B) CLONE: HW080 

(ix) FEATURE: 

(A) NAME/KEY: roisc.feature 

(B) LOCATION: 629.. 919 

(D) OTHER INFORMATION: /label* SAC„24033 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 71: 



\3 X VfUAvuU X VJ 


(ZGC A A A AH A A 


GGTTCCACCA 


GAGCCTGTGG 


GTTTGTCCTT 


AGGCTAGAGG 


60 




TY2f2 A' APPTYSft 


TGGAATCAGT 


AGGAGGGACA 


TTGAAGCTCA 


CAGATATATC 


1 90 


f\\*t\\~\3 X iVJVJ X 


<^-rV\j X iut X X C 


TGCAACTGGT CCGTTAGGTG CTGTGGGACG AGCCCTTTCC 


xou 


lAULrl 1L.X XL. 


& PPTfPT RPR 


TCCACTGTCT GCCTGTGTCT 


CAGTTTACAT 


CTTCCTTTTG 




AATATGGAAT 


CTCAAATAGC 


CCAGGCTAGC 


TTCAAATTTG 

i 


TTAAGTAGTT 


GAGGACAACC 


300 


TCAATCTTTC 


TATTCGCTGC 


CTCCACCTCT 


CTCAGTTTAC CTTTTTTTTT 


TTTCTCCTTA 


360 


GGATTAAGTC 


CCGTACAGGC 


CCAGAGTGGT 


AAGCCATAAC 


ACCCCTGGTC 


TTTCTCTCTT 


420 


CCTCTCAAGA 


TTTCCTCAGG 


i 

CTACCCCTTT 


TCCTTCTAGC 


TCTCTCTCTC 


j 

TGCTAACGCC 


480 


GAGCCCTGAT 


TGTTAACCTG 


TGTCTCCCTC 


TTCATCCTTC 


TGAGACAATT 


ACCCAGGATG 


540 


CGAATGTTCC 


TCTGTGAGCC 


CGGGTGTACT 


GGCTGGGATT 


GTGCTGGGCG 


ACTTGGTGCT 


600 


GACTCTGCTC 


ATCGCCCTGG 


CGGTGTACTC 


TCTGGGCCGC 


CTGGTCTCTC 


GAGGCCGAGG 


660 


GACTGCAGAC 


GGGACCCGGA 


AACAGCACAT 


GGCTGAGACT 


GAGTCACCTT 


ATCAGGAGCT 


720 


TCAGGGTCAG 


AGGCCAGAAG 


TATACAGTGA 


CCTCAACACA 


CAGAGGCAGT 


ATTACAGATG 


780 


AACCCACCCT 


ATGCCCACCA 


ACAACCTGAT 


GCCCGGATCC 


ACTCATTCCA 


GACGCTTACT 


840 


CAACAAACCC 


TCCCTGGGAT 


CAGGACTCCC 


GCTGGAATAA 


ATATCCACAG 


AGTGCCCTCT 


900 


GGGAGATATC 


TGACCTTGTA 


CCGTTTCTGT 


CCCCAAATAA 


AAGACGAAGC 


AAAAACAAAA 


960 


A 












961 



(2) INFORMATION FOR SEQ ID NO: 72: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1890 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
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(vii) IMMEDIATE SOURCE: 
(B) CLONE: HW082 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 150.. 569 

(ix) FEATURE: 

(A) NAME /KEY: misc_feature 

(B) LOCATION: 1381.. 1848 

(D) OTHER INFORMATION: /label* SAC„24469 

(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 72: 

GTCGACCGCG GCCGCGTCGA CGGACTTGCG GTTCCTTCAG 1 CATTCTGCCC GCGCACTGAG 60 

TGGCTGTTTC CTGGAGCTAC TGTGTTCCTT CGGGAGTGCA GCAGATCCCG TTCAGAACCT 120 

ATTACCAAGA GCCCCGAGCA GGACCCACC ATG CAG TGC TTC AAA TTd ATT AAG 173 

Met Gin Cys Phe Lys Phe lie Lys 

1 i 5 

GTC ATG ATG ATC CTC TTC AAT CTA CTC ATC TTT CTC TGT GGT GCA GCC 221 
Val Met Met He Leu Phe Asn Leu Leu He Phe Leu Cys Gly Ala Ala 
10 15 20 i 

CTG TTG GCT GTG GGA ATC TGG GTG TCC GTC GAC GGG ACA TCT TTC CTG 269 
Leu Leu Ala Val Gly He Trp Val Ser Val Asp Gly Thr Ser Phe Leu 
25 30 35 40 

AAG GCC TTC GGA TCA CTA TCA TCG AGT GCC ATG CAG TTT GTC AAC GTG 317 
Lys Ala Phe Gly Ser Leu Ser Ser Ser Ala Met Gin Phe Val Asn Val 

45 50 55 

GGC TAG TTC CTC ATC GCC GCT GGT GCT GTA CTC TTT ATT CTT GGT TTC 365 
Gly Tyr Phe Leu He Ala Ala Gly Ala Val Leu Phe He Leu Gly Phe 

60 65 70 

CTG GGC TGC TAT GGT GCT CAC TCC GAG AAC AAG TGT GTG CTC ATG ATG 413 
Leu Gly Cys Tyr Gly Ala His Ser Glu Asn Lys Cys Val Leu Met Met 
75 80 85 

GTG TGT TGG ACC CAG CTC CCC AGG CCC ACA ACC AAG GGT TCC CCC TTC 461 
Val Cys Trp Thr Gin Leu Pro Arg Pro Thr Thr Lys Gly Ser Pro Phe 
90 95 100 

ATC CCT GAC ACC AGC CTC TGG GAT CCC AAA ACC CTA CTC TGG ATT CTG 509 
lie Pro Asp Thr Ser Leu Trp Asp Pro Lys Thr Leu Leu Trp He Leu 
105 110 115 120 

GAA GCC TCG CTG TGG GGA CCC CCT GCC TCT TAT GGG AGT GTG CAG GAA 557 
Glu Ala Ser Leu Trp Gly Pro Pro Ala Ser Tyr Gly Ser Val Gin Glu 

125 130 135 

GGT GTG TAT CCC TAAAACGTGT TTCCTTTCTC CACTCAGTTC TTTTCCATTC 609 
Gly Val Tyr Pro 

140 

TCCTCATCAT CTTCATTGCG GAGATTGCAG GGGCCGTGGT TGCTTTGGTG TACACCACAA 669 
TGGTGAGGCA CGGGATGGGG CATGGGAGGA AAACTGGAGC AAAGCTCTGA GAGTTGGTGG 729 
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TGGCCTGTGA 


ATGGCTACTT 


CCCCTACCAG 


TCTATTAATA 


CTGGCCTGAC 


CCGTCCAGGC 


789 


TGAACAATTC 


CTGACATTTC 


TGGTGGTGCC 


TGCTATCGAA 


AAGGACTATG 


GTTACCAGAC 


849 


TGAATTCACC 


CAAGTGTGGA 


ACTCTACGAT 


GGAGGGGGTA AGGTGGGCCG 


GGAGGATTTG 


909 


GGAATGGAGA 


GGAGAAAAAA 


ATCAAGGCAC 


CATCAGTTAC 


CAACCCCATT 


TTGTCTGCAG 


969 


CTGCATTGCT 


GTGGCTTCAA 


CAATTACACA 


GATTTTAACT 


CCTCACGTTT 


CGTCAAAGAG 


1029 


AACAAAGTCT 


TTCCTCCGTT 


CTGTTGCGCC 


AACAACACTG 


ACAGTCATAC 


AGTTGAACCA 


1089 


TGCACCGAGG 


ACAAGGCCAA 


AAGTATGAAC 


GTACAGGTAT 


GTGCTGGTGT 


GGAATTAGCT 


1149 


TTTTCGTTTG 


TGATCCGGTT 


TCTCTGTGTA 


GCTGTGGATG 

i 


TCCTGGAACT 


CGATATGTAG 


1209 


ACCAGGCTGG 


GCTTGAACAG 


AGATCGACCT 


GCCTCTGCCT 


CGTGAGTGCT 


GGGGTTAAAG 


1269 


GCCACCATCA 


CCACCCAGCG 


CCAGTGTAAT 


TGCTTTAATG 


GCTTTACAGT 


AAGGAGTAAA 


1329 


GGGGTTGGAG 


GTAGGGCACT 


GCCAGCTATA 


AATACTTTCT 


TCTCTCTCCC 


AAGGGTTGTT 


1389 


TCAAACAGAT 


TCTGCAGAAA 


ATCAGAACCA 


ATGCAGTTAC 


CGTGGGTGGT 


GTGGCAGTTG 


1449 


GAGTTGCGGC 


CCTAGAGCTG 


GCTGCCATGG 


TGGTATCCAT 


GTATCTATAC 

i 


TGCAATCTGA 


1509 


AGTAAGACTG 


CTTCTTCCTC 


CTGACTTGTC 


GTCACATGGA 


AACCAGGAAG 


AGGCACTTGG 


1569 


CACAGCTAAG 


CAGCAGCAAT 


GGGTGGAGAT 


GAAACGTCAT 


TTGGGCTGGA 


GTGGATCTGC 


1629 


CTTTGCTGCT 


CTGGACTTCG 


GGCTAATTAG 


GGACCACCGC 


TCGGGCTGTG 


TCTGTCTTGT 


1689 


TCTTCCACTA 


CTGGGGCGAG 


GGGGGCATTG 


GTCCCAGTCT 


CTGAGGTAGC 


CATGTCTGCC 


1749 


CATCTCCTGA 


GATTCTTGAT 


GATCCCAGCA 


CTCCACTGGT 


GGATGAAGGC 


ACTTATAGCC 


1809 


TGGGCATATT 


CTAGCACTGC 


AAAGTCATAG 


AAGGAATGTA 


CAGAAGGTGT 


TTTGGAACTT 


1669 


ATAAACCAAT 


GAAAATAAAA 


A 








1890 



(2) INFORMATION FOR SEQ ID NO: 73: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH s 140 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73: 

Met Gin Cye Phe Lys Phe lie Lys Val Met Met lie Leu Phe Asn Leu 
15 10 15 

Leu lie Phe Leu Cys Gly Ala Ala Leu Leu Ala Val Gly lie Trp Val 

20 25 30 

Ser Val Asp Gly Thr Ser Phe Leu Lys Ala Phe Gly Ser Leu Ser Ser 
35 40 45 

Ser Ala Met Gin Phe Val Asn Val Gly Tyr Phe Leu lie Ala Ala Gly 
50 55 60 
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Ala Val Leu Phe lie Leu Gly Phe Leu Gly Cys Tyr Gly Ala His Ser 
65 70 75 80 

Glu Asn Lys Cys Val Leu Met Met Val Cys Trp Thr Gin Leu Pro Arg 

85 90 95 

Pro Thr Thr Lys Gly Ser Pro Phe lie Pro Asp Thr Ser Leu Trp Asp 

100 105 110 

Pro Lys Thr Leu Leu Trp He Leu Glu Ala Ser Leu Trp Gly Pro Pro 
115 120 125 

Ala Ser Tyr Gly Ser Val Gin Glu Gly Val Tyr Pro 
130 135 140 

(2) INFORMATION ?OR SEQ ID NO: 74: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2181 base pairs 

(B) TYPE:, nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
<iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(vii) IMMEDIATE SOURCE: 
(B) CLONE: HW083 

(ix) FEATURE: 

(A) NAME /KEY: misc_feature 

(B) LOCATION: complement (1950.. 2181) 
(D) OTHER INFORMATION: /label- SAC_24528 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74: 



GTCGACCCAC 


AATCGCCCGA 


GCACTTGGAA 


TCACAGCAGA 


AACGACCCAA 


TCCTGAGCTC 


60 


CGTAGAAATG 


TGACCATCAA 


AACTGAACTC 


CCACTAGGCG 


CACGGCGAAA 


GATGAAGCCA 


120 


CTGCTCCCAC 


GGGTTAGCTC 


ATACCTGGTG 


CCCATCCAGT 


TCCCGGTGAA 


CCAGTCCCTG 


180 


GTGTTACAGC 


CCTCGGTGAA 


GGTTCCCTTG 


CCTCTGGCAG 


CATCTCTTAT 


GAGCTCAGAG 


240 


CTTGCCCGTC 


ATAGCAAGCG 


AGTCCGCATT 


GCACCCAAGG 


TGCTGCTATC 


CAACGAAGGG 


300 


ATAGCCCCAC 


TTCCTGCCAC 


AGAACCCATG 


AAGGAGGAGA 


AACCCCTGCT 


TGGAGAAGGG 


360 


CTATTGCCTT 


TGCTTCCTAT 


TCAGTCCATT 


AAGGAAGAAG 


TAATTCAGCC 


TGGGGAGGAC 


420 


ATACCACACT 


TAGAGAGGCC 


TATCAAAGTG 


GAGAGCCCTC 


CCTTGGAAGA 


GTGGCCCTCT 


480 


CCGTGTGCAT 


CAGTGAAAGA 


GGAACTGTCC 


AACTCCTGGG 


AAGATTCTTC 


CTGCTCTCCT 


540 


ACCCCAAAGC 


CCAAGAAGTC 


CTATTGTGGG 


CTTAAGTCCC 


CAACACGGTG 


TGTCTCAGAA 


600 


ATGCTGGTGA 


CAAAGCGGAG 


AGAGAAGAGA 


GAGGTGAGCC 


GATCTCGGAG 


GAAGCAGCAC 


660 
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CTTCAGCCAC 


CCTGTCTAGA 


TGAGCCTGAA 


CTCTTCTTCT 


CAGAGGACTC 


CAGCACATTT 


720 


CGGCCAGCCA 


TGGAGATCCT 


GGCAGAGTCT 


TCAGAGCCTG 


CACCACAGCT CAGCTGCCCT 


780 


CAGGAGGAGG 


GAGGGCCCTT 


CAAGACCCCC 


ATCAAGGAGA CATTGCCTGT 


CTCCTCCACT 


840 


CCTAGCAAGT 


CTGTGCTCTC 


TAGAGACCCT 


GAGTCCTGGA 


GGCTCACACC 


CCCAGCCAAA 


900 


GTTGGGGGGT 


TAGATTTCAG 


CCCAGTACGA 


ACCCCCCAGG 


GTGCCTTTGG 


CCCTCTGCCT 


960 


GACTCGCTGG 


GGCTTATGGA 


GCTGAATACC 


ACACCTCTGA 


AAAGTGTTCC 


CCTCTTCGAC 


1020 


TCACCCCGGG 


AGCTCCTTAA 


CTCAGAAGCC 


TTTGACCTTG 


CCTCTGATCC 


CTTTAGCAGT 


1080 


TCTCCACCAC 


CACATTTGGA 


AGCCAAGCCA 


GGCTCCCCCG 


AGCTGCAGGT 


CCCCAGCCTT 


1140 


TCAGCCAACC GTTCTCTCAC AGAAGGCCTT GTCCTGGACA CAATGAATGA TAGCCTCAGC 


1200 


AAGATCCTTC 


TAGACATCAG 


TTTCCCTGGC 


CTGGAGGAGG 


ACCCTCTGGG 


CCCTGACAAC 


1260 


ATCAACTGGT 


CTCAGTTCAT 


CCCTGAGCTG 


CGATAGAOGC 


AGGGTCTTAC 


CCTTGCCACT 


1320 


CAAGCCACCA 


GTTATCCTGG 


CACTTGTGTG 


GCTGGATAGT 


GCAAGGCTCA 


GTGTACCCCA 


1380 


AACCGTCTGA 


GGGAGCTAGC 


AGGCAAGGGC 


TGAGCGGTGC 


CCTTTGACCT 


AATTATGCCA 


1440 


AGGTAAAAGC 


CACGTCTAAG 


CCACTGCTGG 


GACCTATGCA AGCAATAGGA 


TCTCCCAGAG 


1500 


TCCTCCACTC 


CCTGCTGGCA 


AGTGAAGTGG 


GTGTGACAGA 


GCCGTGAGGA 


CCAGGAAATG 


1560 


CCCACCCATT 


AGTCACCTGC 


TGCTCCTGGC 


AGGATAACCC 


TTGTAAATGG 


TGTCAGTTCC 


1620 


CCAAGTTGTC 


CTGTAATTAT 


AAATGTAGCC 


ATATTCCCTT 


AGCTCTCATT 


ATCCAGAGAC 


1680 


TGCCAGGATG 


GGTAGGGTGA 


CAAGGGGTTG 


CATTAGCTTC 


TGCTTGTGGC 


CTTTGGGGGC 


1740 


AGGACCTGCA 


GTTCAGCCTC 


TTCACACTGT 


GGGTTCTGCT 


GTAGGCTTCT 


AGACACACAG 


1800 


GTGTCCTTGC 


CAGGACCCCA 


CTTACTGCCC 


TTTCCTCACA 


GCTCCCCCTG 


GTTCTAAGCC 


1860 


AGTGGTACTG 


CATGAAGAAA 


TCCTGCGGCA 


AAGCCTATTG 


GCTCTGGGTG 


TGTGGGGACG 


1920 


GGTGTGCCTG 


AAGCAAAAGC 


ATGGGTACTC 


ACGTGAGTCC 


TTTAGGTGTT 


TCTCTGATCG 


1980 


TGTTCCCAAT 


CATGCCAGGG 


AGTCTAGCAT 


TGAGAACTCA 


GGCTGAGGCC 


TGAGGAGGAG 


2040 


GAGGAAGTGA 


CCACTGACTT 


GCCTGGCTTC 


CTTAGCTTGC 


ACCTGAGTTT 


TGCAAAAAGC 


2100 


CACCCTAGAC 


CCCACTCTAC 


AAGCTAGCAC 


AAGAACACTA 


CTGTAACTAC 


CTACTGAATA 


2160 


AAGCCCAGGT 


GGCCTGAAAA 


A 








2181 


(2) INFORMATION FOR SEQ ID NO: 75 


• 
• 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3336 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
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{iii} HYPOTHETICAL: NO 



(vii) IMMEDIATE SOURCE: 
(B) CLONE: HW084 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: complement (163. .391) 

(D) OTHER INFORMATION: /label= SAC„24854 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:75: 



TTTTTCCACC 


AAGAGGGAGT 


CCTTTAATAG 


ATTGCACAGA 

i 


CAGACACAAC 


AGGCCAGCAC 


60 


CCTGCTTGCA 


GGCCCAGGCG 


CTTTGCCAGC 


AGATGCAATG 


GCCTCCCCCA 


CCTGGGTGTG 


120 


GGCCGCAGGG 


TGGAGATGAG 


GTGGTTTAAA 


AAACACAGCT 


GTACTAATTC 


TTCACTTTCA 


180 


CAGAAAAGAG 


AAACTTGGGG 


GAGCAGCTCA 


CAGGAGAATT 


TACATAGAGC 


TGAGTCTGAA 


240 


TGTCCACCCC 


TAGAGAAGTT 


GAACCTCCCC 


CTGAGCTGTC 


TGTGCTCAGC 


AGGGGGCAGA 


300 


CAGCACATGG 


AGGGCGCAGG 


CCAGTCCACA 


GAGGAGCGCT 


GACTTAAACA 


GCTGTCCTCC 


360 


CTTCCCAGNT 


GTTAGNATTT 


CTACACNATG 


TACATTGTTG 


i 

TGCTTTATTG 


CTCCCACCCC 


420 


AGCCTCTGCC 


TGAAGCCCAG 


GGTGGAGGGA 


GGAACACCTG 


AACCAGATTC 


AGCTGGGTGG 


480 


GAGCACCAAG 


CCCCCAATGG 


CTGGTGCTCT 


GTAAGNTTCT 


GAGGCTGCCA 


GGCTCNNGNC 


540 


ACTCGGAGCT 


GCAGACCAAG 


GGTACAGAGC 


CAGCCGACCA GCCAGCTCTT 


CTGAGGAGCA 


600 


GCAGAAACAG 


AAGGATCAGT 


GCTGGACAAG 


ATGCAGGCTG 


GGTAGGAGCA 


GGCAGGGTGG 


660 


AAGGACGAGT 


GCAAGGCCAT 


GAGTCTGGGG 


CAGAGGGTTG 


TGCACATGGG 


GCTGTGGGGG 


720 


CCTGCAGGGC 


ACCTCACGAG 


ACAGAACCAT 


TCTGTAGCAG 


AGCTTGACGG 


AGGAAGGGTG 


780 


TGAGGACCAC 


TGGCCTGGGG 




CACAGAGCTG 


CTTCCAGTTC 


ATCAGGGGCT 




GGCAGTGGCA 


GGCCCCAGCT 


CGGGGCGTGT 


GCATTGGGAG 


ATTCAGGGAG 


AACCNGCAGG 


900 


CAGCCTGTGG 


GGCTGGGTGA 


GGAGGGAAGG 


CTGTGGTGGT 


ATCTTTGAGT 


NTGTCTTGGC 


960 


CGGGCCAGGG 


GTTAACACAG 


GATCAGTTGT 


TCTCAAAGAG 


TGAGAGCAGA 


TCATCACTGC 


1020 


TGTTTGTGGG 


GAGGTCAGGT 


GGACCCAGGT 


AGGAGAGTAG 


TTCATCAGGA 


TTGGTCAGTT 


1080 


CTGGGAGCAG 


ANCNAGAGCT 


GGTTCTGAGG 


CCTCCCCTGC 


CCCAGTCATG 


CTGGGAGGCC 


1140 


CCATCATGCC 


CGTGGCAGNA 


TTGAAAGCCA 


GTTCACTGAT GGGCCCTGTG 


TTTGGTTGGC 


1200 


CCAGGGGCTG 


CCTGGATGCA 


GGGGAAGGGT 


TTGGATGGTG 


TAGCTGGGTG 


CCTGGGGTGG 


1260 


GCCCGAGGTT 


GGGAGTGTGC 


AGTCCTGGTG 


GTCCAGGATT 


ATGGGCTGGG 


TCCCAGGTGA 


1320 


CCCTGCCTGG 


TGCCCATCCT 


GGCCTGGGAG 


GCAAGGAGCA GACTTGNNNN GAGTCAGGAG 


1380 


GCTGCTGGGA ATGTCAGACT 


GGTAGGAGAT 


TGGGGGTGGC 


CCCTGGGGTG 


AACTCAGCAA 


1440 


GGTTTGGGGT 


GGCGGGTGTA 


GCAGATGGGA 


AAGATTCTGG 


GAAGGTTCCA GGCCCCCAGG 


1500 
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AAGTTGGAAC 


CCTGGCTGGG 


NTAGTCGNTG 


GGGGTAGGGN 


CCNAAGGGGC 


TGCAATGGGG 


1560 


CAAANGGGCA 


GCGCCAGGGC 


CCAGGGCCGC 


AATCATCTCC 


ATCACACTGG 


GCATGAGCAC 


1620 


GTGGGCAGGG 


CTCACAGTAC 


GGCAGCGCTT 


CAGCACTGGG 


CCCGTCCGGC 


TCCTCCTTGA 


1680 


TGTGTAGGTC 


AGGCTTCACA 


GGCACTGGTT 


TCCAGCTGCA 


TGTGGGGTCG 


ATGGTGATCT 


1740 


CCTCGTAGTC 


TGAGTTCTGA 


ATATAAATCA GGATGCCTAG 


CATATACTGA 


TCCACCTCCA 


1800 


GGCCCTCCAG 


CAATGCCGTC 


TTGTTGCACA 


CCGGGACACC 


TCCAGGTCCC 


CCGCTCACAG 


1860 


TTAAGCTGTA AGTATGACTC 


CAGGTCAAAG 


CACTGTATGT 


GGCGACAGTC 


GTGACCACGA 


1920 


GCTGGGAGCT GGATCCTGCG 


GAAGGTGATG 


gggcacttca! 


GGGACACCTT 


GATAGCCGTC 


1980 


TGCTCCACCC 


CATCCTCTCC 


ATTGGGCCCA 


GGGGTGCCAG 


GGATGGTGCC 


GCTACTGAAG 


2040 


TTCCGCTTTA 


TCTTGGTGAT 


GCAGTGCTCA 


GCTGGCAAGN 


GGCGCTTCTT 


GAGGAGACCC 


2100 


TGCAGCACAG 


AGCGGACAGA 


CGGACGGTGC 


ACCAGCTGCA 


jGCACGAAGAG 


GTGGGAGCAG 


2160 


CAGCAGGCGG 


TGACGGTGAT 


CTGGATGGTG 


TTGCGACCCG 


GCTGGCACAC 


ATGCTTCAGG 


2220 


TAGAGGGGCT 


TGTGCGAGGT 


CTTGTTGTCT 


CCACGCTCGA 


TGCTGAGGGG 


TGTGNCATTG 


2280 


ACGCTGACCT 


GCACCGNNGC 


CGGCCANTTG 


GTGGTTCATC 


( TGCCGGTCCT 


CGTGGTGGTA 


2340 


GCACTTGAAC 


TGCGAACTGA 


GCTCTAAGTC 


AGGTCTCAGC 


ATCAGGGTTT 


TATACACAGA 


2400 


GTCTCGNAGC 


TGGAAGACAT 


GGTTGCTCAC 


AGCCAGGTTG 


TGCTGCAGGC 


GGAAGGGCTC 


2460 


CAGGACCACC 


CCATCCCGAA 


CTGGGAAGGT 


CAGGCGCAGC 


TCGTCACAGG 


GGACGCTTCC 


2520 


AGAGGGTGAT 


GGGTGCAAGG 


AGGCTGAGGC 


CTGGCTTGAG 


GTCAGGCAGG 


AAAGGAGACT 


2580 


TGACCTCCTG 


GCTTGGGGAC 


ACGTAGGGAG 


CACTGCTGCT 


AGGCGTCATG 


GGTGGTGTGG 


2640 


GATTCCCCGG 


CAGTGGGGAG 


CTGGGGTAAG 


CAGGGATGGA 


GCGGGAAGGC 


CCACTCAAGC 


2700 


CAGGCTGGCT 


GTAGCTGATG 


CTGCCCCCGT 


TGAANCTGGC 


GCCCTGCCCG 


TTGAACTGCT 


2760 


CTGTGGGCTT 


GTAATGCAGN 


CCTGTGGGGC 


CAGGGGCAGA 


CAGGGACTGG 


GCCATGCCCT 


2820 


GCTGCAGGGG 


GAGTCTGTGT 


CCTGCGTAGG 


AGGAGGCAGG 


GCCAGGGGGC 


TGCCCGGGAC 


2880 


CAGGAGCATA 


CTGGGCAGTG 


CTGGCTGCAT 


ACTGGCCNNC 


TTGCAGATAT 


TGNTGNCNAG 


2940 


GATACACCTC 


TGAGTAGGTC 


TCTCTTGACC 


CNCTGTCGAG 


GCAGCGGCTG 


CCCCTGGGGA 


3000 


GGCCCAGGGT 


AGCCATGCTG 


AGGCAGTCGC 


TGTCCTGCAT ACAAGGGTGT 


CATGCCTGCT 


3060 


GCCCGTGTAG 


GATTCATGGC 


CATGGAGGAA 


AGGCCTGAGG 


GGCCCATCAT 


TCCTCCCATG 


3120 


CCTGAAGGGT 


TCATGCCAGG 


GGGTACACTG 


GGCCCTCGAG 


GACCTCCATG 


CTGCAGAAAC 


3180 


TGGCTGTTAA AAGATTGCCC 


AGTCCCCATC 


GCTCCATATT 


GGCTTAGCTT 


CCTGGCTCTN 


3240 


CTTCTCTTGC AGGGNNGCCA 


CAGTAGCTGT 


GGCTGTGGCT 


GTGGCCGTNC 


TGCAGCAGCA 


3300 


GCCATGGCAG 


CTGCAGCTCT 


GCTTGTGTGA 


AGTCAG 






3336 
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(2) INFORMATION FOR SEQ ID NO: 76: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1494 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: HW086' 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 9.. 686 

(ix) FEATURE: 

(A) NAME /KEY: misc_feature 

(B) LOCATION: complement (612.. 909) 

(D) OTHER INFORMATION: /label= SAC„24336 

i 

1 ! 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 76: 

GTCGACCC GCT GCG GTC TAT CTG GGC AGC ACC ATG CTC GCG CTC CGA AGC 50 
Ala Ala Val Tyr Leu Gly Ser Thr Met Leu Ala Leu Arg Ser 
1 5 10 

GGC CTG AGG ACG GCG CTG GCC CCG CGG GCT CTG ACG CCT CAG GTA TGT 98 
Gly Leu Arg Thr Ala Leu Ala Pro Arg Ala Leu Thr Pro Gin Val Cys 
15 20 25 30 

TCA CCT TTT GCT ACA GGC CCC AGG CAA AGC GAT GGG ACA TTG TAT GAA 146 
Ser Pro Phe Ala Thr Gly Pro Arg Gin Ser Asp Gly Thr Leu Tyr Glu 

35 40 45 

TTC CGC ACC TAC TCT CTG AAG CCC TCA AAG ACA AAT GCG TTC CTG CAG 194 
Phe Arg Thr Tyr Ser Leu Lys Pro Ser Lys Thr Asn Ala Phe Leu Gin 

50 55 60 

AAT TTT CAG AAA TAC GTT CAC CTT CGG ACA GCT CAC TCT GAG ATG ATT 242 
Asn Phe Gin Lys Tyr Val His Leu Arg Thr Ala His Ser Glu Met He 
65 70 75 

GGA TAC TGG ACC GTA GAA TTT GGA GGC AAA ATA AAC AGA GTG TTC CAT 290 
Gly Tyr Trp Thr Val Glu Phe Gly Gly Lys He Asn Arg Val Phe His 
80 85 90 

ATT TGG AAG TAT GAT AAT TTT GCT CAT CGA ACT GCC GTC CGC AAA GCC 338 
He Trp Lys Tyr Asp Asn Phe Ala His Arg Thr Ala Val Arg Lys Ala 
95 100 105 110 

TTG GCC AAG GAT AAG GAA TGG CAA GAA CAA TTC CTC ATT TCA AAT TTG 386 
Leu Ala Lys Asp Lys Glu Trp Gin Glu Gin Phe Leu He Ser Asn Leu 

115 120 125 

GCT TTC ATG GAT GAA CAA GAG GTT GAG ATT ACC TAC TTG GTA CCC TGG 434 
Ala Phe Met Asp Glu Gin Glu Val Glu He Thr Tyr Leu Val Pro Trp 

130 135 140 
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TGT AAA ATA AGA ACA CCT CCC AAG GAA GGA GTC TAT GAA CTG GCT ACC 482 
Cys Lys He Arg Thr Pro Pro Lys Glu Gly Val Tyr Glu Leu Ala Thr 
145 150 155 

TTT CAG ATG AAG CCT GGT GGC CCA GCT CTA TGG GGT GAT GCA TTC AAA 530 
Phe Gin Met Lys Pro Gly Gly Pro Ala Leu Trp Gly Asp Ala Phe Lys 
160 165 170 

AGG GCA ATA AAC GCC CAT GTT GAG CTT GGC TAC TCT ACA CTA GTT GGT 578 
Arg Ala He Asn Ala His Val Glu Leu Gly Tyr Ser Thr Leu Val Gly 
175 180 185 190 

GTT TTC CAC ACT GAA TAT GGA GCC CTC AAC AGA GTT CGG GAA AGT GTC 626 
Val Phe His Thr Glu Tyr Gly Ala Leu Asn Arg Val Arg Glu Ser Val 

195 200 205 

AAT TAC CTC GAG TCT CAG CAG AAT ATG TTC CTG ATT CCA ACC TCA TTT 674 
Asn Tyr Leu Glu Ser Gin Gin Asn Met Phe Leu He Pro Thr Ser Phe 

210 215 220 

TCA CCA TTG AAG TAGTTTCCTA CCGAGCGCTG AGCATTCCCA TCACGGAGGT 726 
Ser Pro Leu Lys 
225 



GAGATGCGTT 


TCCCCCAGAC 


AATCGCCCTG 


TTCATTGTCT 


GAAGGAGGTG 


CTGAGCTACC 


786 


GCACTGCCCT 


CTGCATGTCT 


GAGACTTCAT 


GTATCCTTTA, 


CTACCAGTCG 


CTCAGGAAAT 


846 


GTTTCTGTCC 


CTTTTCATGT 


CTACTGGATG 


TCAGAAAGTG 


GTGTTATTAC 


TCCATGCCCT 


906 


GGTACTTCAT 


TGTTCTTCTG 


TGGTTTTATG 


ATTAAGTCTT 


GTCCTCAAAA 


TATGATTTTG 


966 


AAATTTTTTC 


CCTCAAAATA 


TTTGGAAATT 


AAATAAATAC 


AATATCTGCC 


TTTTTATTGC 


1026 


GTTCATTTTC 


CCTGAAATTT 


TTAACTGTTT 


TTACCAGTAA 


TGTTTACTTT 


GTGATTTCAA 


1086 


ATGGGATTGA 


TAGAAGTCAG 


AGAAAGAACT 


GGAAGAGCTT 


GAAGGGGCTC 


GAGACCCCAT 


1146 


ATGTACAACA 


ATGCCAAGCA 


ACCAGAGCTT 


CCAGGGACTA 


AGCCACTACC 


TAAAGACTAT 


1206 


ACATGGACTG 


ACCCTGGACT 


CTGACCTCAT 


AGGTAGCAAT 


GAATATCCTA 


GTAAGAGCAC 


1266 


CAGTGGAAGG 


GGAAGCCCTG 


GGTCCTGCTA 


AGACTGAACC 


CCAGTGAACG 


TGATTGTTGT 


1326 


GGGGAGAGTG 


GTAATGGGGG 


GAGGATGGGG 


AGGGGAAGCC 


CATATAGAAG 


GGGAGGGAGA 


1386 


GGGGTTAGGG 


GGATGTCGGC 


CCAGAAACCA 


GGAAGGGGAA 


TAACAATTGA 


AATGTAAATA 


1446 


AGAAATACTC 


AAGTTAATAA 


AGATGGAGAA 


AAGAAAAAGA 


AAGTCGAC 




1494 



(2) INFORMATION FOR SEQ ID NO: 77: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 226 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:77: 
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Ala Ala Val Tyr Leu Gly Ser Thr Met Leu Ala Leu Arg Ser Gly Leu 
1 5 10 15 

Arg Thr Ala Leu Ala Pro Arg Ala Leu Thr Pro Gin Val Cys Ser Pro 

20 25 30 

Phe Ala Thr Gly Pro Arg Gin Ser Asp Gly Thr Leu Tyr Glu Phe Arg 
35 40 45 

Thr Tyr Ser Leu Lys Pro Ser Lys Thr Asn Ala Phe Leu Gin Asn Phe 
50 55 60 

Gin Lys Tyr Val His Leu Arg Thr Ala His Ser Glu Met He Gly Tyr 
65 70 75 80 

Trp Thr Val Glu Phe Gly Gly Lys He Asn Arg Val Phe His lie Trp 

85 90 95 

Lys Tyr Asp Asri Phe Ala His Arg Thr Ala Val Arg Lys Ala Leu Ala 

100 105 110 

\ 

Lys Asp Lys Glu Trp Gin Glu Gin Phe Leu lie Ser Asn Leu Ala Phe 
115 120 125 

Met Asp Glu Gin Glu Val Glu He Thr Tyr Leu Val Pro Trp Cys Lys 
130 1 135 1 140 

He Arg Thr Pro Pro Lys Glu Gly Val Tyr Glu Leu Ala Thr Phe Gin 
145 150 155 160 

Met Lys Pro Gly Gly Pro Ala Leu Trp Gly Asp Ala Phe Lys Arg Ala 

165 170 175 

He Asn Ala His Val Glu Leu Gly Tyr Ser Thr Leu Val Gly Val Phe 

180 185 190 

His Thr Glu Tyr Gly Ala Leu Asn Arg Val Arg Glu Ser Val Asn Tyr 
195 200 205 

Leu Glu Ser Gin Gin Asn Met Phe Leu He Pro Thr Ser Phe Ser Pro 
210 215 220 

Leu Lys 
225 

(2) INFORMATION FOR SEQ ID NO: 78: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1082 base pairs 
(B> TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 



(vii) IMMEDIATE SOURCE: 
{B) CLONE: HW089 

(ix) FEATURE: 

(A) NAME /KEY: CDS 
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(B) LOCATION: 8. .1006 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 961.. 1076, 

(D) OTHER INFORMATION: /label= SACJ24461 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 78: 

GTCGACC AAG CAA CAG CAG GGA AGT GAA TTT CAG ACA CTG CTG GAG GCT 49 
Lys Gin Gin Gin Gly Ser Glu Phe Gin Thr Leu Leu Glu Ala 
15 10 

GAG GAG ACC AAA GCC GCC TCT GAC GCT TCA AAT AAA AAG CTT TGG TCC 97 
Glu Glu Thr Lys Ala Ala Ser Asp Ala Ser Asn Lys Lys Leu Trp Ser 
15 , 20 25 30 

CTC AGA ACT CTA CCC TTC CAG CCC TGC TCT GAA GCC TCA GAC GTG AGA 145 
Leu Arg Thr Leu Pro Phe Gin Pro Cys Ser Glu Ala Ser Asp Val Arg 

35, 40 45 

ACC ATG TCT GTG ACC CTG CAA GCT GTC CTC TAC AGT CTT CAG GAG GAG 193 
Thr Met Ser Val Thr Leu Gin Ala Val Leu Tyr Ser Leu Gin Glu Glu 

50 1 55 60 

CAG TCC AGG CTC AAG ACG AGG CTG CAG GAA CTA CAG CAG CTG AAA AGG 241 
Gin Ser Arg Leu Lys Thr Arg Leu Gin Glu Leu Gin Gin Leu Lys Arg 
65 70 75 

GAG CGC ATA GAT GTT CCC AGA GAC AAG ATC CCA TTC CCG GTG CCT GAA 289 
Glu Arg lie Asp Val Pro Arg Asp Lys lie Pro Phe Pro Val Pro Glu 
80 85 90 

GTC CCC GTG GTA TTC CAA GGT CTA ACT AAG CAG AGC GGC CAA GCA CCC 337 
Val Pro Val Val Phe Gin Gly Leu Thr Lys Gin Ser Gly Gin Ala Pro 
95 100 105 110 

AAG TCT CTA GTT TCT AAC TTG AAG ATC TGC TGC CCT CTG CCT GGA GGT 385 

Lys Ser Leu Val Ser Asn Leu Lys He Cys Cys Pro Leu Pro Gly Gly 

115 120 125 

TCT GCT CTG ATC ACC TTT GAT GAC CCC AAA GTG GTT GAG CGG TTG CTA 433 
Ser Ala Leu He Thr Phe Asp Asp Pro Lys Val Val Glu Arg Leu Leu 

130 135 140 

CAA CAA AAG GAA CAT AAA GTT GAC ATC GAG GAC TGC CGG CTG CGG GTG 481 
Gin Gin Lys Glu His Lys Val Asp He Glu Asp Cys Arg Leu Arg Val 
145 150 155 

CAG ATC CAG CCC TTG GAG CTG CCT ATG GTG ACC AAC ATC CAG GTG TCC 529 
Gin He Gin Pro Leu Glu Leu Pro Met Val Thr Asn He Gin Val Ser 
160 165 170 

AGC CAG CCG GAT AGC CAC AGG GTG CTA GTC AGT GGC TTT CCT ACT GGA 577 
Ser Gin Pro Asp Ser His Arg Val Leu Val Ser Gly Phe Pro Thr Gly 
175 180 185 190 

CTT AGG CTG AGT GAG GAG GAG CTA CTG GAC AAG TTA GAG ATC TTC TTC 625 
Leu Arg Leu Ser Glu Glu Glu Leu Leu Asp Lys Leu Glu He Phe Phe 

195 200 205 

GGC AAG GCC AAG AAT GGA GGT GGG GAT GTG GAG ACC CGG GAG ATG CTG 673 
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Gly Lys Ala Lys Asn Gly Gly Gly Asp Val Glu Thr Arg Glu Met Leu 

210 215 220 

CAA GGG ACT GTC ATG CTG GGA TTT GCT AAT GAA GAA GTG GCC CAG CAC 721 
Gin Gly Thr Val Met Leu Gly Phe Ala Asn Glu Glu Val Ala Gin His 
225 230 235 

TTA TGC CAG ATT GGC CAG TTC AGA GTC CCA CTG GGT CGG CAG CAG GTC 769 
Leu Cys Gin lie Gly Gin Phe Arg Val Pro Leu Gly Arg Gin Gin Val 
240 245 250 

CTT TTG AGG GTC TCT CCC TAT GTG AGT GGG GAG ATC CAG GAA GCT GAG 817 
Leu Leu Arg Val Ser Pro Tyr Val Ser Gly Glu lie Gin Glu Ala Glu 
255 260 265 270 

ATC AAA TTC CAG CAA GCA CCT CAT TCA GTG CTG GTG ACG AAT ATT CCC 865 
He Lys Phe Gin Gin Ala Pro His Ser Val Leu Val Thr Asn He Pro 

275 280 ' 285 

GAC GTC CTG GAT GTG CAG GAA CTG CAT GAC ATT CTG GAG ATC CAC TTC 913 
Asp Val Leu Asp Val Gin Glu Leu His Asp He Leu Glu He His Phe 

290 295 , 300 

CAG AAG CCT ACC CGT GGG GGC GGA GAG GTG GAG GCC CTG GCA GTT GTA 961 
Gin Lys Pro Thr Arg Gly Gly Gly Glu Val Glu Ala Leu Ala Val Val 
305 310 ,315 

CCT GTG GGG CAG CAG GGC CTG GCT GTC TTT ACT TCT GAG TCA AGC 1006 
Pro Val Gly Gin Gin Gly Leu Ala Val Phe Thr Ser Glu Ser Ser 
320 325 ' 330 

i • 

TAGGGCTGCA TGTCCACCTG CTGGGCATGG GACTGCTCAT TGCCTAATGC TACCAAGAAC 1066 
AGTGAAAGGC GTCGAC 1082 



(2) INFORMATION FOR SEQ ID NO: 79: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 333 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:79: 

Lys Gin Gin Gin Gly Ser Glu Phe Gin Thr Leu Leu Glu Ala Glu Glu 
15 10 15 

Thr Lys Ala Ala Ser Asp Ala Ser Asn Lys Lys Leu Trp Ser Leu Arg 

20 25 30 

Thr Leu Pro Phe Gin Pro Cys Ser Glu Ala Ser Asp Val Arg Thr Met 
35 40 45 

Ser Val Thr Leu Gin Ala Val Leu Tyr Ser Leu Gin Glu Glu Gin Ser 
50 55 60 

Arg Leu Lys Thr Arg Leu Gin Glu Leu Gin Gin Leu Lys Arg Glu Arg 
65 70 75 80 

He Asp Val Pro Arg Asp Lys He Pro Phe Pro Val Pro Glu Val Pro 
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U5 * L vs Val Val Glu Arg Leu Leu Gin Gin 

U eu He Thr Phe Asp Asp Pro Lys u0 

U0 m tie Glu Asp Cys Arg Leu Arg Val Gin II. 

Ly8 Glu His Lys Val Asp lie Glu Asp ^ 

145 T . rln val Ser Ser Gin 

«. « - as ™ - v " ™ S ^ ° 
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m,, He Phe Phe Gly Lys 
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195 .riu Met Leu Gin Gly 

ua W5 « «y «v S - - - « SS 

210 , » n » pin His Leu Cys 

, Ma Asn Glu Glu Val Ala Gin His 

Thr Val Met Leu Gly Phe Ala Asn ^ 

225 1 01n nn Val Leu Leu 

, „i Phe Arg Val Pro Leu Gly Arg Gin Gin Va ^ 
Gin He Gly Gin Phe Arg va KQ 

val Ser Gly Glu He Gin Glu Ala Glu He Lys 
Arg val Ser Pro Tyr Val Ser Gly ^ 270 

260 * TiD Pro Asp Val 

, he «, «. « - - ffi - val M " S! 

275 ti« His Phe Gin Lys 

«•„ &an He Leu Glu He His 
> m val Gin Glu Leu His Asp lie u 
Leu Asp vai »iu 2Q6 

290 , Ma val Val Pro Val 

rlv Glv Gly Glu Val Glu Ala Leu Ala Val V ^ 
Pro Thr Arg Gly Gly <*iy 315 

305 , Ma Val Phe Thr Ser Glu Ser Ser 

Gly Gin Gin Gly Leu Ala Val Pn ^ 

(2 ) INFORMATION FOR SEQ ID NO: 80: 

B TYPE: nucleic acid 
C) STRANDEDNESS: double 
(D) TOPOLOGY: linear 

MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

IMMEDIATE SOURCE: 
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(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION; 335. .937 

(ix) FEATURE: 

(A) NAME /KEY : roisc_£eature 

(B) LOCATION: 1355.. 1434 

(D) OTHER INFORMATION: /label= SAC_24197 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 80: 



GTCGACCTTT 


AAGCTTACCA 


TCAGAACAAC AAATCAAAAT GTAAACTTAA AATATAACCA 


60 


AAAGAGGGAC 


AGCTCTTTAG 


GAAAAGGAAA AAACCTTAAA TAGTGAATAA ACAACTACAA 


120 


CCACTTAACC 


ATTGTAGGCT 


TAAAAGCAGC CATCAATAAA GAAAGCGTTC AAGCTCAACA 


180 


TACACACACA 


CACACACACA 


CACACACGCT CCCTCCCTCC GGCGCTCTAC TCCGCGCCCG 


240 


CCCGCCCGCT 


CGCCCGCCCG 


CCCGCCTCGC ACGCACCGGC CTCGGCTCCC CGCGCCCGCC 


300 


GCCGAGCGAA 


GCTGCTCTGG 


GATCGGAGCA GCTC ATG GAG AAA GTG CCG GGC 

Met Glu Lys Val Pro Gly 
1 5 


352 



GAC ATG GAG ATA GAG CGC AGG GAG AGG AAC GAG GAG CTG TCC GAG GCG 400 
Asp Met Glu lie Glu Arg Arg Glu Arg Asn Glu Glu Leu Ser Glu Ala 

10 15 20 

GAG AGG AAG GCG GTT CAG GCT ACG TGG GCC CGG CTG TAT GCC AAC TGC 448 
Glu Arg Lys Ala Val Gin Ala Thr Tip Ala Arg Leu Tyr Ala Asn Cys 
25 30 35 

GAG GAC GTG GGG GTG GCC ATC CTG GTG AGG TTC TTT GTG AAC TTC CCG 496 
Glu Asp Val Gly Val Ala lie Leu Val Arg Phe Phe Val Asn Phe Pro 
40 45 50 

TCG GCC AAG CAG TAC TTC AGC CAG TTT AAA CAC ATG GAG GAT CCG TTG 544 

Ser Ala Lys Gin Tyr Phe Ser Gin Phe Lys His Met Glu Asp Pro Leu 
55 60 65 70 

GAG ATG GAG AGG AGT CCT CAG CTG CGG AAA CAT GCC TGC CGG GTC ATG 592 
Glu Met Glu Arg Ser Pro Gin Leu Arg Lys His Ala Cys Arg Val Met 

75 80 85 

GGG GCC CTC AAC ACG GTC GTG GAG AAC CTG CAT GAC CCA GAC AAG GTA 640 
Gly Ala Leu Asn Thr Val Val Glu Asn Leu His Asp Pro Asp Lys Val 

90 95 100 

TCC TCT GTG CTC GCG CTG GTT GGC AAG GCC CAC GCC CTC AAG CAC AAG 688 
Ser Ser Val Leu Ala Leu Val Gly Lys Ala His Ala Leu Lys His Lys 
105 110 115 

GTG GAA CCT ATG TAC TTT AAG ATT CTC TCT GGG GTC ATT CTG GAC GTG 736 
Val Glu Pro Met Tyr Phe Lys lie Leu Ser Gly Val lie Leu Asp Val 
120 125 130 

ATC GCC GAG GAG TTT GCC AAT GAC TTC CCC GTC GAG ACG CAG AAA GCC 784 
lie Ala Glu Glu Phe Ala Asn Asp Phe Pro Val Glu Thr Gin Lys Ala 
135 140 145 150 

TGG ACC AAG CTG CGC GGT CTC ATC TAC AGC CAT GTG ACC GCG GCC TAC 832 
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Trp Thr Lys Leu Arg Gly Leu lie Tyr Ser His Val Thr Ala Ala Tyr 

155 160 165 



AAG GAA GTG GGC TGG GTA CAG CAG t CCT AGC AGC CAC ACT GCC CTC TTC 
Lys Glu Val Gly Trp Val Gin Gin Pro Thr Ser His Thr Ala Leu Phe 

170 175 180 



880 



AGG GCC ATA GGA CCC CTC CCA GCT CCC CCC TCC CTG GCA GCA CTT TGG 
Arg Ala lie Gly Pro Leu Pro Ala Pro Pro Ser Leu Ala Ala Leu Trp 
185 190 195 



928 



GCA GAA GGC TGACTTCTGA AGACACTCCT TGACCTTCCA TTTCTGGGTG 
Ala Glu Gly 
200 



977 



CCAAGGAAGC TGGAGGAACC CCTGACTGGA CTTCCCGAAA GGAGGCCTTC ATCATGGCTG 1037 

TGGTCTCCCA CGTTACTGGG AGGTGGGGCT GGCTACCTGG ATGCTGAATT TAGCAGAGGG 1097 

GAACCCTTCT GCCTCGCAGC AGAGGGCACA CCCTCCTTAG CTTTCACTGT CAAGAGAGGG 1157 

ACCTGGCATG TGGCCGGTGG GAAGCAAGGA CAAGCCTTTG AGTCACTTCA GAAAGGAACT 1217 

i 

AGCCCTGCTT CACTCTGCCC TACCAGCATG CAGGTTCTCC TCACCATCTC TGAGCATTGC 1277 

GCACACACAC GTCTACCATA TACACAGATA TATTCTATAT A(CGAGCTCTA TATAAATATA 1337 



TATACACACA CACACGTACA TATCTGTCGC GTGCACCTGC AGGCCCAGAA GCCTAGGCCT 1397 

CGCACAGCTG TGGGAGACCC GGAAGAGTCC TGGCCGTACT, TCAGGGCAGC TCTACCGGGA 1457 

GCAGGGAGAA AGCTTGCAAC AGCACGAGAG GGCGGGGCCG GGTGGGAGAG TCGGGGTGAC 1517 

TCAGCTCACC CCGCACAGCT CAGTGCCCTG CAATCTTCCC CTTCCGCCCC CCAAAAGAGA 1577 



ACACTTGGTT TGCCCGGTGT CGGGGCGCAC ACCTGTAATC CCCGAGTTTG GGACGCAGAG 1637 

GAAGGACAAT CTTTGCAAGT CTGAGGTCAG GTGGTCTACA CTTGGTCTCC ATCTTGGTGA 1697 

GTTTCCGGCT AGCTGGGGTC GACTCAGGGA GACCCCGTGT CAAACAAAAC AAGAAGCGAA 1757 

GAGAAGATCT GGCTCCCCTC CCCTCCCCAA GGGTTTGCCC GACTGACTTG CTCCGGAAAG 1817 

GGGCACACCG AGAATTCAGG AGACCCCAGT GGGGTGCTGG CCAGTGTAGG TGTCCGGCGT 1877 

TTGGGGGTGG GGTGAGGGGC AGCTTCAGCG TGGAGCCCTA GAATGTCAGC CTCACCCGCT 1937 

CACTTCAGAC AGCAGCCAGG CCACAGGACC AGGGGCCATA CTCAAACCCC TCAGCAGAGA 1997 

CCAGGGCGCC TCTCTCCTGC CCTCCTCGGG GCCACCTGCC CATCCTCTCC AACTAGTCTC 2057 

CGTAACATTC GCTGGGGACC CCGTGTGTAC CCCTTGGTCA CGTGTACTGT CCTGGAGGAC 2117 

GTCTTACTGC TGTGGCTACC GTGCCTGCGT CCCCTCTGGT CCCACCCTCA CGCAGCCTTC 2177 

CTGTAACTGC CTGTCTTTTT GTAGTTTCTG ATGTTTGTAA CCAGACCCAG CTGTGTCATT 2237 

AAACAGGTCT GTTCTTCCTG TAAAAA 2263 

(2) INFORMATION FOR SEQ ID NO: 81: 



(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 201 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 81: 

Met Glu Lys Val Pro Gly Asp Met Glu lie Glu Arg Arg Glu Arg Asn 
1 5 10 15 

Glu Glu Leu Ser Glu Ala Glu Arg Lys Ala Val Gin Ala Thr Trp Ala 

20 25 30 

Arg Leu Tyr Ala Asn Cya Glu Asp Val Gly Val Ala lie Leu Val Arg 
35 40 1 45 

Phe Phe Val Asn Phe Pro Ser Ala Lys Gin Tyr Phe Ser Gin Phe Lys 
50 55 60 

His Met Glu Asp Pro Leu Glu Met Glu Arg Ser Pro Gin Leu Arg Lys 
65 70 75 , 80 

His Ala Cys Arg Val Met Gly Ala Leu Asn Thr Val Val Glu Asn Leu 

85 90 95 

i 

His Asp Pro Asp Lys Val Ser Ser Val Leu Ala Leu Val Gly Lys Ala 

100 105 110 

His Ala Leu Lys His Lys Val Glu Pro Met Tyr Phe Lys lie Leu Ser 
115 120 125 . 

Gly Val lie Leu Asp Val He Ala Glu Glu Phe Ala Asn Asp Phe Pro 
130 135 140 

Val Glu Thr Gin Lys Ala Trp Thr Lys Leu Arg Gly Leu He Tyr Ser 
145 150 155 160 

His Val Thr Ala Ala Tyr Lys Glu Val Gly Trp Val Gin Gin Pro Thr 

165 170 175 

Ser His Thr Ala Leu Phe Arg Ala He Gly Pro Leu Pro Ala Pro Pro 

180 185 190 

Ser Leu Ala Ala Leu Trp Ala Glu Gly 
195 200 

(2) INFORMATION FOR SEQ ID NO: 82: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1740 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL : NO 



(vii) IMMEDIATE SOURCE: 
(B) CLONE: HW092 
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(ix) FEATURE: 

(A) NAME/KEY: miec„feature 

(B) LOCATION: complement {303.. 566) 

(D) OTHER INFORMATION : /labels SAC_24320 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 82: 



GTCGACCTCG 


GAGCTGGGGA 


CCGAACCCAG 


GGCCTTGCGC TTGCTAGGCA 


AGCGCTCTAC 


60 


CACTGAGCTA 


AATCCCCAAC 


CCCTAGCCCT 


TTGTTTTTTA AGAAAGTCTA 


GTGTAGCCCA 


120 


GGTTGGCCTT 


AAATTCCTGA 


TCCTCCTGCC 


CCTATCTGTA ATCTTGGGAT 


TATAGGTGTG 


180 


GGCCATCACA 


CTGTTTATCC 


AGTGbTGAAT 


CAAACCCAGA ATCTTCCACA 


TGCTAGGCGA 


240 


GCAGTCTCCT AAATACTACA CGGCTATAGG TTCAGCCTAG ATCTCTCTTT TGAAGCTCTC 

i 


300 


GTACACCTAT 


AGGGGCCACT 


AATCCTAAAG 


ACGCAGAGGA TGAAAACCAC 


CAGGCCCAAG 


360 


GGTTAACGTG GAAAACAGAG CCCTGGTGTC CCCCTGGGGA ACCAGCCCTT CATGTTCTAA 


420 


GACAGTCATC 


ACCCAGATTC 


CTGAAAACAA 


TAGCGGCTTC TTGTTTAGAA 


CGGAACCGCA 


480 


CCTGCTGGAA 


TACGTGTCAA 


CAAACAATCG 


CTTTpTTTTT CATATTCGCC 


ATCCAAGCTG 


540 


AGGCAATGGG ACTTCCTGAA GCGAGTACTG GAAGACTTAT TCTTAGAAGA GGACAGAATT 


600 


CTCATTTTAA 


CACGAGGCTC 


AGGGGCCGAG 


TGCTGGCCAT CTTCCTTCCT 


TCTTTTGTAC 


660 


TCCCTGGTGT 


TCATCTTATT 


TGGTTTTCAA 


CAGGTTTATT TCCATTGCAG 


CCCGGGGCTT 


720 


CCAACGCCCA 


AAAACGCCCA GCCCACGATG 


CCGTAAATGG TGAGCACGCT 


GGCTTCTGCT 


780 


TCCTCTCCCT 


GAGAACCCAG 


ACCCTTTGTT 


GTTCCGCCAT TCCAAAAGGC 


CCTGGGAAGG 


840 


TTCTTTGCTC 


CTCTGGTGAC 


AGGCTTGGGA 


TGAAGACTTG GGAACTCCTA 


TTTGCTAAAC 


900 


TCTCGCTCCT 


GGCTAAATCT 


TTCTGCTTGT 


TCTTTTTCCT CGTCTAATCA 


TGCTACCTGC 


960 


TACTACTTTT 


CCTGATGCAG 


TCTGACGAAC 


GTCTGTTGAA CACATCGGGA 


CAGTTTTAAA 


1020 


AGGAGTTATT 


TTGGTGGCAA 


AGGCGCTCAG 


TTGTGATTTA AAAGTGTTTT AAGTGATTCA 


1080 


TCCAAGGTTG 


TCTCCATGGA 


CTCCGGAAAT 


AACCACAGCC AAAATACCCC 


GGCGAGAGCA 


1140 


GAATATGAGA 


GGCAGATAGC 


ACCTATTATT 


ACTTTTATCT GGGTGCAGCC 


GGGCTCCAGA 


1200 


TTAATGGGGG 


CAAGCAAAGG 


TCCAGGCCTC 


AGACATGGCC AATTTTAAGC 


AAACTAGTGC 


1260 


TAGTGTTGAT 


AAGTATTCAA 


TAAGGGCTTG 


GAATCTAATT ATTTGTAAGA 


AGTGTCTTTG 


1320 


TTTCCTGATC 


TGTAAAGGAG 


ATGGGGAAAT 


ACTGTGATTT CGTGCCATTT 


TAAAAAAAAT 


1380 


GAAAAAGCAA 


CATAAAAAAA 


TGCAAATACA 


GAAGAAAAGA TGTGGGGTTT 


GAAGTTGGCT 


1440 


TTAGAGAGCT 


GTGTAACCTT 


GGGGAAGTCA 


CCTCGTCTCT CTGTGACTTG 


AATTCCTAAT 


1500 


CAACAATAAT 


GCTGGGGGAC 


AAGGTGCTCT 


CCAAATTCCC CTTCAGATCA 


GAAAGTCTGG 


1560 


GATAACTATC 


CCAGATAGAT 


CCAAACTTGT 


AAAGGAATCA AAAGGAATAG 


TCTAGTGTGA 


1620 


CAGATAACAC 


GGACTTGAAA 


CTTAGTTCAC 


CTGATATTTC TTAATGACTT 


AACACCACTA 


1680 
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TACCTCACCC CAGAGCTAGC CTGTCCCCTG TGTGCCCACC AGTAACCTTG AAATGTCGAC 1740 



(2) INFORMATION FOR SEQ ID NO: 83: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 1385 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 



(vii) IMMEDIATE SOURCE: 
(B) CLONE: HW093 

(ix) FEATURE: 

(A) NAME /KEY: CDS , 

(B) LOCATION : 8.. 1357 

(ix) FEATURE: 

(A) NAME/ KEY: misc.f eature < 

(B) LOCATION: 798.. 1385 

(D) OTHER INFORMATION: /label= SAq_24538 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83: 

GTCGACT CTG CCT CAG CCG CTG CCG CCG CCA CTG CTG CTG CAG GAT CCG 49 
Leu Pro Gin Pro Leu Pro Pro Pro Leu Leu Leu Gin Asp Pro 
15 10 

CCG CAG CCA CCA GTC TCG CGC GTC GTC GCT CCC TCC TTG GAC AGT AAT 97 
Pro Gin Pro Pro Val Ser Arg Val Val Ala Pro Ser Leu Asp Ser Asn 
15 20 25 30 

TTT ATG AAT AAG CAT CAG AAG CCG GTA CTA ACA GGC CAG CGG TTC AAA 145 
Phe Met Asn Lys His Gin Lys Pro Val Leu Thr Gly Gin Arg Phe Lys 

35 40 45 

ACC CGG AAA AGG GAT GAA AAA GAG AAA TTC GAA CCC ACA GTC TTC AGG 193 
Thr Arg Lys Arg Asp Glu Lys Glu Lys Phe Glu Pro Thr Val Phe Arg 

50 55 60 

GAT ACA CTT GTC CAG GGG CTT AAT GAA GCT GGT GAT GAC CTT GAA GCT 241 
Asp Thr Leu Val Gin Gly Leu Asn Glu Ala Gly Asp Asp Leu Glu Ala 
65 70 75 

GTA GCC AAA TTC TTG GAC TCT ACT GGC TCA CGA CTA GAT TAT CGT CGC 289 
Val Ala Lys Phe Leu Asp Ser Thr Gly Ser Arg Leu Asp Tyr Arg Arg 
80 85 90 

TAT GCA GAC ACA CTC TTT GAT ATC CTG GTG GCT GGC AGC ATG CTT GCC 337 
Tyr Ala Asp Thr Leu Phe Asp lie Leu Val Ala Gly Ser Met Leu Ala 
95 100 105 110 

CCT GGA GGA ACA CGC ATA GAT GAT GGT GAC AAG ACC AAG ATG ACC AAC 385 
Pro Gly Gly Thr Arg He Asp Asp Gly Asp Lys Thr Lys Met Thr Asn 

115 120 125 
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CAC TGT GTG TTT TCA GCA AAT GAA GAT CAT GAA ACC ATC CGA AAC TAT 433 
His Cys Val Phe Ser Ala Asn Glu Asp His Glu Thr lie Arg Asn Tyr 

130 135 140 

GCT CAG GTC TTC AAC AAA CTC ATC AGG AGA TAC AAA TAT TTG GAA AAG 481 
Ala Gin Val Phe Asn Lys Leu lie Arg Arg Tyr Lys Tyr Leu Glu Lys 
145 150 155 

GCA TTT GAA GAT GAA ATG AAA AAG CTT CTC CTC TTC CTT AAA GCA TTT 529 
Ala Phe Glu Asp Glu Met Lys Lys Leu Leu Leu Phe Leu Lys Ala Phe 
160 165 170 

TCT GAA GCA GAG CAG ACA AAG TTG GCA ATG CTG TCT GGG ATC CTG TTG 577 
Ser Glu Ala Glu Gin Thr Lys Leu Ala Met Leu, Ser Gly lie Leu Leu 
175 180 185 190 

t 

GGC AAT GGA ACC CTC CCG GCC ACC ATC CTT ACC AGT CTC TTC ACC GAC 625 
Gly Asn Gly Thr Leu Pro Ala Thr lie Leu Thr Ser Leu Phe Thr Asp 

195 200 205 

AGC TTA GTC AAA GAA GGC ATC GCA GCC TCG TTC GCT GTC AAG CTT TTC 673 
Ser Leu Val Lys Glu Gly lie Ala Ala Ser Phe Ala Val Lys Leu Phe 

210 215 220 

AAA GCC TGG ATG GCA GAG AAA GAT GCC AAT TCT GTC ACC TCT TCA TTG 721 
Lys Ala Trp Met Ala Glu Lys Asp Ala Asn Ser Val Thr Ser Ser Leu 
225 230 , 235 

AGA AAA GCC AAC CTG GAC AAG AGG CTG CTT GAA CTC TTC CCA GTG AAC 769 
Arg Lys Ala Asn Leu Asp Lys Arg Leu Leu Glu Leu Phe Pro Val Asn 
240 245 250 

AGA CAG AGT GTG GAT CAT TTC GCC AAG TAC TTC ACT GAC GCA GGA CTG 817 
Arg Gin Ser Val Asp His Phe Ala Lys Tyr Phe Thr Asp Ala Gly Leu 
255 260 265 270 

AAG GAG CTG TCA GAC TTT CTC CGA GTC CAG CAG TCA CTG GGC ACC AGG 865 
Lys Glu Leu Ser Asp Phe Leu Arg Val Gin Gin Ser Leu Gly Thr Arg 

275 280 285 

AAG GAA CTG CAG AAG GAG CTG CAA GAG CGA CTG TCT CAG GAA TGC CCC 913 
Lys Glu Leu Gin Lys Glu Leu Gin Glu Arg Leu Ser Gin Glu Cys Pro 

290 295 300 

ATC AAG GAG GTG GTG CTT TAC GTC AAA GAG GAA ATG AAA AGG AAC GAC 961 
He Lys Glu Val Val Leu Tyr Val Lys Glu Glu Met Lys Arg Asn Asp 
305 310 315 

CTT CCG GAG ACA GCT GTG ATC GGG CTG CTG TGG ACC TGC GTC ATG AAC 1009 
Leu Pro Glu Thr Ala Val He Gly Leu Leu Trp Thr Cys Val Met Asn 
320 325 330 

GCT GTG GAA TGG AAC AAG AAG GAG GAG CTG GTT GCT GAG CAG GCT CTT 1057 
Ala Val Glu Trp Asn Lys Lys Glu Glu Leu Val Ala Glu Gin Ala Leu 
335 340 345 350 

AAG CAC CTG AAG CAA TAC GCT CCC CTG CTG GCC GTG TTC AGT TCC CAA 1105 
Lys His Leu Lys Gin Tyr Ala Pro Leu Leu Ala Val Phe Ser Ser Gin 

355 360 365 

GGC CAG TCA GAG CTT GTC CTC CTG CAG AAG GTT CAG GAA TAC TGC TAT 1153 
Gly Gin Ser Glu Leu Val Leu Leu Gin Lys Val Gin Glu Tyr Cys Tyr 
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370 375 380 

GAC AAC ATC CAC TTC ATG AAA GCC TTC CAG AAG ATC GTG GTT CTC TTT 1201 
Asp Asn He His Phe Met Lys Ala Phe Gin Lys He Val Val Leu Phe 
385 390 395 

TAT AAA GCC GAC GTT CTG AGC GAG GAG GCG ATA CTG AAG TGG TAT AAA 1249 
Tyr Lys Ala Asp Val Leu Ser Glu Glu Ala He Leu Lys Trp Tyr Lvs 
400 405 410 

GAA GCA CAT GCC GCC AAA GGC AAA AGC GTC TTC CTT GAC CAG ATG AAG 1297 
Glu Ala His Ala Ala Lys Gly Lys Ser Val Phe Leu Asp Gin Met Lys 
415 420 425 430 

AAA TTT GTG GAG TGG TTA CAA AAT GCA GAA GAA GAA TCT GAA TCA GAA 1345 
Lys Phe Val Glu Trp Leu Gin Asn Ala Glu Glu Glu Ser Glu Ser Glu 

435 440 445 

GGT GAA GAA AGC TAAATGTCTC CAGCACACTG CCTAAAAA 1385 
Gly Glu Glu Ser 

450 

(2) INFORMATION FOR SEQ ID NO: 84: 

(i) SEQUENCE CHARACTERISTICS : 1 

(A) LENGTH: 450 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84: 

Leu Pro Gin Pro Leu Pro Pro Pro Leu Leu Leu Gin Asp Pro Pro Gin 
15 10 15 

Pro Pro Val Ser Arg Val Val Ala Pro Ser Leu Asp Ser Asn Phe Met 

20 25 30 

Asn Lys His Gin Lys Pro Val Leu Thr Gly Gin Arg Phe Lys Thr Arg 
35 40 45 

Lys Arg Asp Glu Lys Glu Lys Phe Glu Pro Thr Val Phe Arg Asp Thr 
50 55 60 

Leu Val Gin Gly Leu Asn Glu Ala Gly Asp Asp Leu Glu Ala Val Ala 
65 70 75 80 

Lys Phe Leu Asp Ser Thr Gly Ser Arg Leu Asp Tyr Arg Arg Tyr Ala 

85 90 95 

Asp Thr Leu Phe Asp He Leu Val Ala Gly Ser Met Leu Ala Pro Gly 

100 105 no 

Gly Thr Arg He Asp Asp Gly Asp Lys Thr Lys Met Thr Asn His Cys 
115 120 125 

Val Phe Ser Ala Asn Glu Asp His Glu Thr He Arg Asn Tyr Ala Gin 
130 135 140 

Val Phe Asn Lys Leu lie Arg Arg Tyr Lys Tyr Leu Glu Lys Ala Phe 
145 150 155 160 
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Glu Asp Glu Met Lys Lys Leu Leu Leu Phe Leu Lys Ala Phe Ser Glu 

165 170 175 

Ala Glu Gin Thr Lys Leu Ala Met Leu Ser Gly lie Leu Leu Gly Asn 

180 185 190 

Gly Thr Leu Pro Ala Thr He Leu Thr Ser Leu Phe Thr Asp Ser Leu 
195 200 205 

Val Lys Glu Gly He Ala Ala Ser Phe Ala Val Lys Leu Phe Lys Ala 
210 215 220 

Trp Met Ala Glu Lys Asp Ala Asn Ser Val Thr Ser Ser Leu Arg Lys 
225 230 ' 235 240 

Ala Asn Leu Asp Lys Arg Leu Leu Glu Leu Phe Pro Val Asn Arg Gin 

, 245 250 255 

Ser Val Asp His Phe Ala Lys Tyr Phe Thr Asp Ala Gly Leu Lys Glu 

260 265 270 

Leu Ser Asp Phe Leu Arg Val Gin Gin Ser Leu Gly Thr Arg Lys Glu 
275 280 285 

Leu Gin Lys Glu Leu Gin Glu Arg Leu Ser Gin Glu Cys Pro He Lys 
290 \ 295 300 

Glu Val Val Leu Tyr Val Lys Glu Glu Met Lys Arg Asn Asp Leu Pro 
305 310 315 320 

Glu Thr Ala Val He Gly Leu Leu Trp Thr Cys Val Met Asn Ala Val 

325 330 335 

Glu Trp Asn Lys Lys Glu Glu Leu Val Ala Glu Gin Ala Leu Lys His 

340 345 350 

Leu Lys Gin Tyr Ala Pro Leu Leu Ala Val Phe Ser Ser Gin Gly Gin 
355 360 365 

Ser Glu Leu Val Leu Leu Gin Lys Val Gin Glu Tyr Cys Tyr Asp Asn 
370 375 380 

He His Phe Met Lys Ala Phe Gin Lys He Val Val Leu Phe Tyr Lys 
385 390 395 400 

Ala Asp Val Leu Ser Glu Glu Ala He Leu Lys Trp Tyr Lys Glu Ala 

405 410 415 

His Ala Ala Lys Gly Lys Ser Val Phe Leu Asp Gin Met Lys Lys Phe 

420 425 430 

Val Glu Trp Leu Gin Asn Ala Glu Glu Glu Ser Glu Ser Glu Gly Glu 
435 440 445 

Glu Ser 
450 

(2) INFORMATION FOR SEQ ID NO: 85: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 1463 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 



(vii) IMMEDIATE SOURCE: 
(B) CLONE: HW094 

(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 255.. 1238 

(ix) FEATURE: « 

(A) NAME /KEY: roisc_feature 

(B) LOCATION: 1050.. 1355 ! 

(D) OTHER INFORMATION: /label* SAC 23896 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:85: ( 

GTCGACCCCG AACACCCACA CTCGGTCCCC AGCCGTTTTT CTCCCAGAGA TCCGGAAGCC 60 

CCGACGTCGT GCCCTACACC AAAGATCGAG AAAACCACTC AGACGCGGCT CCTAGGCGGC 120 

AGTTCGGTTC CCATAGCAAC CCTCGGAGCA CCCGCATCCC GCTTCCGGCC GGGTGGCTCG 180 

GGCGCTCCCG TGACGTCACG GCGCCGCTTT CCGGTGACTb AGTCGAGCTG AGGTTGTTGT 240 

GGGCCGGGGG CGCC ATG GGG GCG ACT GGC GAC ACC GAG CAG CCG CGG GGC 290 

Met Gly Ala Thr Gly Asp Thr Glu Gin Pro Arg Gly 
15 10 



CCC GGC GGG GCG GAG CGA GGC GGC CTG GAG CTG GGC GAC GCG GGC GCG 
Pro Gly Gly Ala Glu Arg Gly Gly Leu Glu Leu Gly Asp Ala Gly Ala 
15 20 25 

GCG GGC CAG CCG GTT CTC ACG AAC CCT TGG AAC ATA ATG ATC AAA CAC 
Ala Gly Gin Pro Val Leu Thr Asn Pro Trp Asn He Met He Lys His 
30 35 40 

CGG CAG GTG CAG CGA AGA GGC CGC CGA TCT CAG ATG ACC ACA AGT TTC 
Arg Gin Val Gin Arg Arg Gly Arg Arg Ser Gin Met Thr Thr Ser Phe 
45 50 55 60 

ACA GAT CCA GCC ATC TCT ATG GAT CTC CTC CGT GCT GTC CTG CAG CCT 
Thr Asp Pro Ala He Ser Met Asp Leu Leu Arg Ala Val Leu Gin Pro 

65 70 75 

AGC ATC AAT GAG GAG ATC CAG AGT GTC TTC AAC AAG TAC ATG AAG TTC 
Ser He Asn Glu Glu lie Gin Ser Val Phe Asn Lys Tyr Met Lys Phe 

80 85 90 

TTC CAG AAG GCA GCG CTG AAT GTG CGA GAC AAC GTT GGT GAA GAG GTG 
Phe Gin Lys Ala Ala Leu Asn Val Arg Asp Asn Val Gly Glu Glu Val 
95 100 105 

GAT GCA GAA CAG TTG ATT CAG GAG GCC TGC CGC AGC TGC CTG GAG CAG 
Asp Ala Glu Gin Leu He Gin Glu Ala Cys Arg Ser Cys Leu Glu Gin 
110 115 120 



338 



386 



434 



482 



530 



578 



626 
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GCA AAG CTA CTC TTT TCA GAT GGA GAA AAA GTG ATA CCC AGA TTG GCC 674 
Ala Lys Leu Leu Phe Ser Asp Gly Glu Lys Val lie Pro Arg Leu Ala 
125 130 135 140 

CAT GAA CTT CCA GGG ATC AAG CGT GGC CGG CAG GCA GAA GAG GAG TCC 722 
His Glu Leu Pro Gly lie Lys Arg Gly Arg Gin Ala Glu Glu Glu Ser 

145 150 155 

CAC CGA GGA AGC CCC ATT CCC AAA AAG AGG AAA GGT CGG CCT CCT GGA 770 
His Arg Gly Ser Pro lie Pro Lys Lys Arg Lys Gly Arg Pro Pro Gly 

160 165 170 

CAT GTC CTG TCA AAT GAC CGC GCA GCT GCT GGC ATG GTA TGG AAA CCA 818 
His Val Leu Ser Asn Asp Arg Ala Ala Ala Gly Met Val Trp Lys Pro 
175 180 185 

AAA TCC TGT GAA CCA ATT CGC CGA GAA GGC CCC AAG TGG GAC CCA GCT 866 
Lys Ser Cys Glu Pro lie Arg Arg Glu Gly Pro Lys Trp Asp Pro Ala 
190 195 200 

CGG CTG AAT GAA TCT ACC ACC TTT GTT TTG GGG TCT CGA GCC AAC AAG 914 
Arg Leu Asn Glu Ser Thr Thr Phe Val Leu Gly [Ser Arg Ala Asn Lys 
205 210 215 220 

GCC TTA GGG ATG GGA GGC ACC AGA GGG AGG ATT TAC ATC AAG CAC CCA 962 
Ala Leu Gly Met Gly Gly Thr Arg Gly Arg lie Tyr lie Lys His Pro 

225 230 235 

CAC CTC TTT AAG TAT GCA GCA GAT CCT CAG GAP AAG CAC TGG CTG GCT 1010 
His Leu Phe Lys Tyr Ala Ala Asp Pro Gin Asp Lys His Trp Leu Ala 

240 245 t 250 

GAG CAG CAT CAT ATG CGG GCA ACA GGA GGA AAG ATG GCG TAC CTT CTC 1058 
Glu Gin His His Met Arg Ala Thr Gly Gly Lys Met Ala Tyr Leu Leu 
255 260 265 

ATT GAG GAA GAC ATC CGA GAC TTG GCT GCC AGC GAT GAC TAC AGA GGA 1106 
lie Glu Glu Asp He Arg Asp Leu Ala Ala Ser Asp Asp Tyr Arg Gly 
270 275 280 

TGC TTG GAC CTG AAG TTG GAG GAG CTG AAG TCC TTT GTT TTG CCA TCC 1154 
Cys Leu Asp Leu Lys Leu Glu Glu Leu Lys Ser Phe Val Leu Pro Ser 
285 290 295 300 

TGG ATG GTT GAG AAG ATG CGG AAA TAC ATG GAG ACA CTA CGG ACA GAA 1202 
Trp Met Val Glu Lys Met Arg Lys Tyr Met Glu Thr Leu Arg Thr Glu 

305 310 315 

AAT GAG CAT CGT GCT GCT GAA GCA ACT CCC CAG ACC TGAGCCGAGT 1248 
Asn Glu His Arg Ala Ala Glu Ala Thr Pro Gin Thr 

320 325 

GTCCTGGCTA CTACACTTGG CAGTCTGCCT CCCAGACCCT CTTTCCCCGC CCGGCTGAGG 1308 

CCATCATGGG GATGCGGTCT AGTTGGCTCT TAGCAGCATC AAGCTGTACA TGAGCTAGTT 1368 

TGTAGTGACT CACTGCAGAG CCCCCCAGAC TGGCTTGTGG TTCTGTTTCT AAAGTTATTG 1428 

GAATAAGAAG CAATTAAACA AGTTTGTAAT AAAAA 1463 



(2) INFORMATION FOR SEQ ID NO: 86: 
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li) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 328 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 86: 

Met Gly Ala Thr Gly Asp Thr Glu Gin Pro Arg Gly Pro Gly Gly Ala 
15 10 15 

Glu Arg Gly Gly Leu Glu Leu Gly Asp Ala Gly Ala Ala Gly Gin Pro 

20 25 30 

Val Leu Thr Asn Pro Trp Asn He Met He Lys His Arg Gin Val Gin 
35 40 45 

Arg Arg Gly Arg Arg Ser Gin Met Thr Thr Ser Phe Thr Asp Pro Ala 
50 55 60 

He Ser Met Asp Leu Leu Arg Ala Val Leu Gin Pro Ser He Asn Glu 
65 70 75 80 

Glu He Gin Ser Val Phe Asn Lys Tyr Met Lys Phe Phe Gin Lys Ala 

85' '90 95 

Ala Leu Asn Val irg Asp Asn Val Gly Glu Glu Val Asp Ala Glu Gin 

100 105 110 

i 

Leu He Gin Glu Ala Cys Arg Ser Cys Leu Glu Gin Ala Lys Leu Leu 
115 120 . 125 

Phe Ser Asp Gly Glu Lys Val He Pro Arg Leu Ala His Glu Leu Pro 
130 135 140 

Gly lie Lys Arg Gly Arg Gin Ala Glu Glu Glu Ser His Arg Gly Ser 
145 150 155 160 

Pro He Pro Lys Lys Arg Lys Gly Arg Pro Pro Gly His Val Leu Ser 

165 170 175 

Asn Asp Arg Ala Ala Ala Gly Met Val Trp Lys Pro Lys Ser Cys Glu 

180 185 190 

Pro He Arg Arg Glu Gly Pro Lys Trp Asp Pro Ala Arg Leu Asn Glu 
195 200 205 

Ser Thr Thr Phe Val Leu Gly Ser Arg Ala Asn Lys Ala Leu Gly Met 
210 215 220 

Gly Gly Thr Arg Gly Arg He Tyr He Lys His Pro His Leu Phe Lys 
225 230 235 240 

Tyr Ala Ala Asp Pro Gin Asp Lys His Trp Leu Ala Glu Gin His His 

245 250 255 

Met Arg Ala Thr Gly Gly Lys Met Ala Tyr Leu Leu He Glu Glu Asp 

260 265 270 

He Arg Asp Leu Ala Ala Ser Asp Asp Tyr Arg Gly Cys Leu Asp Leu 
275 280 285 
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Lys Leu Glu Glu Leu Lys Ser Phe Val Leu Pro Ser Trp Met Val Glu 
290 295 300 

Lys Met Arg Lys Tyr Met Glu Thr Leu Arg Thr Glu Asn Glu His Arg 
305 310 315 320 

Ala Ala Glu Ala Thr Pro Gin Thr 

325 

(2) INFORMATION FOR SEQ ID NO: 87: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1276 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double , 

(D) TOPOLOGY: linear 

i 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 



(vii) IMMEDIATE SOURCE: 
(B) CLONE: HW095 

(ix) FEATURE: 1 

(A) NAME /KEY: CDS 

(B) LOCATION: 7.. 876 

(ix) FEATURE: 

(A) NAME /KEY: misc_feature 

(B) LOCATION: complement (631.. 798) 

(D) OTHER INFORMATION: /label* SAC„23802 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87: 

GTCGAC ATT TTC TCA GAA TCC TAC GGA GGA AAG ATG GCT GCT GGC ATC 48 
lie Phe Ser Glu Ser Tyr Gly Gly Lys Met Ala Ala Gly lie 

15 10 

AGT TTA GAA CTT CAC AAG GCT ATT CAG CAA GGG ACC ATC AAG TGC AAC 96 
Ser Leu Glu Leu His Lys Ala He Gin Gin Gly Thr He Lys Cys Asn 
15 20 25 30 

TTC TCT GGG GTT GCT TTG GGT GAC TCC TGG ATC TCC CCT GTG GAT TCA 144 
Phe Ser Gly Val Ala Leu Gly Asp Ser Trp He Ser Pro Val Asp Ser 

35 40 45 

GTG CTG TCC TGG GGA CCT TAC CTG TAC AGC GTG TCT CTC CTT GAT AAT 192 
Val Leu Ser Trp Gly Pro Tyr Leu Tyr Ser Val Ser Leu Leu Asp Asn 

50 55 60 

AAA GGC TTG GCT GAG GTG TCC GAC ATT GCG GAG CAA GTC CTC AAT GCT 240 
Lys Gly Leu Ala Glu Val Ser Asp He Ala Glu Gin Val Leu Asn Ala 
65 70 75 

GTA AAC AAG GGC TTC TAC AAG GAA GCC ACT CAG CTG TGG GGG AAA GCA 288 
Val Asn Lys Gly Phe Tyr Lys Glu Ala Thr Gin Leu Trp Gly Lys Ala 
80 85 90 

GAA ATG ATC ATT GAA AAG AAC ACC GAC GGG GTA AAC TTC TAT AAC ATC 336 
Glu Met lie He Glu Lys Asn Thr Asp Gly Val Asn Phe Tyr Asn lie 
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95 100 105 110 

! 

TTA ACT AAA AGC ACC CCC GAC ACC TCT ATG GAG TCG AGC CTC GAG TTC 384 
Leu Thr Lys Ser Thr Pro Asp Thr Ser Met Glu Ser Ser Leu Glu Phe 

115 120 125 

TTC CGG AGC CCC TTA GTT CGT CTC TGT CAG CGC CAC GTG AGA CAC CTA 432 
Phe Arg Ser Pro Leu Val Arg Leu Cys Gin Arg His Val Arg His Leu 

130 135 140 

CAA GGA GAC GCC TTA AGT CAG CTC ATG AAC GGT CCC ATC AAA AAG AAG 480 
Gin Gly Asp Ala Leu Ser Gin Leu Met Asn Gly Pro lie Lys Lys Lys 
145 150 155 

CTC AAA ATT ATC CCT GAC GAT GTC TCC TGG GGA GCC CAG TCA TCC TCC 528 
Leu Lys lie lie Pro Asp Asp Val Ser Trp Gly Ala Gin Ser Ser Ser 
160 165 170 

GTC TTC ATA AGC! ATG GAA GAG GAC TTC ATG AAG CCT GTC ATC GAC ATC 576 
Val Phe lie Ser Met Glu Glu Asp Phe Met Lys Pro Val He Asp He 
175 180 185 190 

GTG GAT ACG TTG CTG GAA CTC GGG GTC AAT GTG ACT GTG TAC AAT GGG 624 
Val Asp Thr Leu Leu Glu Leu Gly Val Asn Val Thr Val Tyr Asn Gly 

195 1 200 205 

CAG CTG GAT CTC ATT GTG GAC ACC ATA GGT CAG GAG TCC TGG GTT CAG 672 
Gin Leu Asp Leu lie Val Asp Thr He Gly Gin Glu Ser Trp Val Gin 

210 215 220 

AAG CTG AAG TGG CCA CAG CTG TCC AGA TTC AAT CAG CTA AAA TGG AAG 720 
Lys Leu Lys Trp Pro Gin Leu Ser Arg Phe Asn Gin, Leu Lys Trp Lys 
225 230 235 

GCC CTG TAC ACC AAT CCT AAG TCT TCA GAA ACA TCT GCG TTT GTC AAG 768 
Ala Leu Tyr Thr Asn Pro Lys Ser Ser Glu Thr Ser Ala Phe Val Lys 
240 245 250 

TCC TAT GAG AAC CTA GCG TTC TAC TGG ATC CTA AAG GCG GGT CAC ATG 816 

Ser Tyr Glu Asn Leu Ala Phe Tyr Trp He Leu Lys Ala Gly His Met 
255 260 265 270 

GTT CCT GCT GAC CAA GGG GAC ATG GCT CTG AAG ATG ATG AGG CTG GTT 864 
Val Pro Ala Asp Gin Gly Asp Met Ala Leu Lys Met Met Arg Leu Val 

275 280 285 

ACT CAG CAG GAG TAGCTGAGCT GAGCTGGCCC TGGAGGCCCT GGAGGCCCTG 916 
Thr Gin Gin Glu 

290 



GAGGCCCTGG 


AGTAGGGCCC 


AGGATGCAGG 


TGCTAATGTC 


TATCCCCGGC 


GCTCTTCTTC 


976 


CCGACTCTAC 


CATGGGATGT 


AACTCCAGGA 


GCCCCTGCCA 


TCTCCGGTAC 


CAAAAGACTG 


1036 


TGGCTTCCGT 


GTCTACTCAG 


AAATCAGTTC 


TACTTCGTAA ACAGTGTTTA AAACCAGACT 


1096 


CATTTAATCA 


GAGTGAAGGA 


TTGCAGTCCA 


TTGGCTTCTT 


AGCACAGAAG 


CAGCTGATAA 


1156 


CACAAGTAAA 


CCCCAGCCCT 


TGAGAGGTAG 


AAGCAAGAGG 


ATCAGAGGTT 


CAAGCGCATC 


1216 


CTCGGCTCCA 


TCACAAGTTC 


AAAAGCCGCC 


TGCACCAAAT 


GGGAGTCCTT 


GTCTCAAAAA 


1276 
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(2) INFORMATION FOR SEQ ID NO: 88: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 290 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 88: 
XX. Phe Ser Glu Ser Tyr Gly Gly Lys Met Ala Ala Gly He Ser Leu 

Oil Leu His Lys Ala He Gin Gin Gly Thr lie Lys Cys As; Phe Ser 



I I 



' , q e r Trt) He Ser Pro Val Asp Ser Val Leu 
Gly Val Ala Leu Gly Asp Ser Trp 45 

3 5 

„ ' r Ser Val Ser Leu Leu Asp Asn Lys Gly 

Ser Trp Gly Pro Tyr Leu Tyr ser va-i. o g() 

50 " 
^ Ala Glu V.1 Ser, Asp U« Ala Glu Gla v.l Leu » «. v.l » 

^ Oly Ph. Tyr Lys Glu Ala Thr Gin Leu Trp Gly I*. A!. Glu -t 

85 , • 

* pi w val Asn Phe Tyr Asn He Leu Thr 
He He Glu Lys Asn Thr Asp Gly Val Asn me y uQ 

100 1U3 

» ^ q er Met Glu Ser Ser Leu Glu Phe Phe Arg 
Lys Ser Thr Pro Asp Thr Ser Met <>au a ^ 

115 " u 

^n- *w* win Val Ara His Leu Gin Gly 
Ser Pro Leu Val Arg Leu Cys Gin Arg His Val Arg 

130 135 
A=p Ala Leu Ser Gl„ Leu AS„ Gly Pro lie Lys W W « Lys 



H. lie Pro Asp Asp Val Ser Trp Gly Ala Gin Ser Ser Ser Val Phe 

165 

He Ser Met Glu Glu Asp Phe Met Lys Pro Val He Asp He Val Asp 

180 

<. „ i val Thr Val Tyr Asn Gly Gin Leu 
Thr Leu Leu Glu Leu Gly Val Asn Val Thr vai ly ^ 

195 2UU 

m . m„ n« ser Trp Val Gin Lys Leu 
Asp Leu lie Val Asp Thr lie Gly Gin Glu Ser Trp 

210 2i:> 

Ly s Tt p Pro G!„ Leu Ser Ar, Ph. Asn Gin Leu Lys Trp Lys Ala Leu 



225 »» 



Qflr e er Glu Thr Ser Ala Phe Val Lys Ser Tyr 
Tyr Thr Asn Pro Lys Ser Ser fciu inr 255 

245 

Glu Asn Leu Al. Ph. Tyr Trp lie Leu Lys Al. G ly Hi. M V.l Pro 

260 263 
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Ala Asp Gin Gly Asp Met Ala Leu Lys Met Met Arg Leu Val Thr Gin 
275 280 285 

Gin Glu 
290 

(2) INFORMATION FOR SEQ ID NO: 89: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2421 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE; cDNA 
Uii) HYPOTHETICAL; NO 



<vii) IMMEDIATE SOURCE: 
(B) CLONE: HW096 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 295*. 1302 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 89: 

GTCGACGTCT GATTTTACAT TCATGCAGTT GGGAGTAAGA GGAAAAAGAT TCTGAAGTGT 60 

GGGGAACTTC ACAAAGAGGG GAACAAAGTC TGTGTCTATG GCTCCAAGGG AGAGAACATC 120 

AGAGACACTC TGAGGAAGGA CGGCAGGTTT TGTTCTTTCA TAGAGAATAA CCATTGGAAA 180 

CTCATTAATG ACCTGGACAC TATCATAGAA AACACTCAGC CATTTGATGA GTTGGAGGGC 240 

AAGCTCTTTC AGGTTGCAGC TGAGCTACCA AAGAACCCTA GGGTAGTCTC TGTC ATG 297 

Met 

1 

CAG AAT TCT GGG TTA GAG AAC AGA GAC TTC CAT AAG CTA GAA GAC TAT 345 
Gin Asn Ser Gly Leu Glu Asn Arg Asp Phe His Lys Leu Glu Asp Tyr 

5 10 15 

ATT GTG AAT GCG AAC CCT ACA TTG AAA GAA GAA GGA GAA AAA CTC AGA 393 
lie Val Asn Ala Asn Pro Thr Leu Lys Glu Glu Gly Glu Lys Leu Arg 
20 25 30 

GCA TAC ATC AAG AAA GAA AGT AAA AAA AGA AAG AAA GGT TCC TTA TTC 441 
Ala Tyr He Lys Lys Glu Ser Lys Lys Arg Lys Lys Gly Ser Leu Phe 
35 40 45 

AAA GTG CAT AAA GAA CAC TTT GGG AAA ATA ACA AAA AAT TCT ACT CCT 489 
Lys Val His Lys Glu His Phe Gly Lys He Thr Lys Asn Ser Thr Pro 
50 55 60 65 

GTT AAA GTG CAC AAA CAT CTT TCG AAA GTC AGT GAT TCA GTT GGG TTC 537 
Val Lys Val His Lys His Leu Ser Lys Val Ser Asp Ser Val Gly Phe 

70 75 80 

CTA TGG TGG AAC AAC AAT GGA AAG GAG GGC TGT GCC ACC TGC TTT GTT 585 
Leu Trp Trp Asn Asn Asn Gly Lys Glu Gly Cys Ala Thr Cys Phe Val 
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* 

TTT AAA GGA TTA TAC ATT TTG ACT TGC CGG CAT GTG ATA ACT AGT ATT 633 
Phe Lys Gly Leu Tyr lie Leu Thr Cys Arg His Val lie Thr Ser He 
100 105 110 

GTG GGT GAA GGC ATA GAT CCA TGT CAG TGG GCA AGC TTA ATT AGT CAG 4 681 
Val Gly Glu Gly He Asp Pro Cys Gin Trp Ala Ser Leu He Ser Gin 
115 120 125 

TGT GTA AAG GTG ACC TTT GGT TAT GAA GAG TTC CCA CTA GTA GAA GAC 729 
Cys Val Lys Val Thr Phe Gly Tyr Glu Glu Phe Pro Leu Val Glu Asp 
130 135 140 145 

AAG TTT TTT GAG GTT AAA CCT TGG TTT GAG ATA f TCT AAT GAA AGC CTT 777 
Lys Phe Phe Glu Val Lys Pro Trp Phe Glu He Ser Asn Glu Ser Leu 

150 155 160 

GAC TAT GCT GTC CTA GAA CTG AGG GAA AAT GGA CAA GAA GTT, CCT GCT 825 
Asp Tyr Ala Val Leu Glu Leu Arg Glu Asn Gly Gin Glu Val Pro Ala 

165 170 175 

i 

GGA CTG TAT AAT GGA ATA GGA CCT GTG CCA CTT GGT GGG TTA ATA TAT 873 
Gly Leu Tyr Asn Gly He Gly Pro Val Pro Leu Gly Gly Leu He Tyr 
180 185 190 

ATC ATT GGC CAT CCT GAT GGA GAA AAG AAG TCT ACT GAT GGC TGT GCA 921 
He He Gly His Pro Asp Gly Glu Lys Lys Ser Thr Asp Gly Cys Ala 
195 200 f 205 

GTG GTC CCT CAA AGT CAT AGA GGG CAA AAA TGT CAG GAA AAT TTT CAA 969 
Val Val Pro Gin Ser His Arg Gly Gin Lys Cys Gin Glu Asn Phe Gin 
210 215 220 225 

GCA AGA GAG AAA GCA AAC TAC TGG ATT TCT AC A TCT TTT TTT CCT ATA 1017 
Ala Arg Glu Lys Ala Asn Tyr Trp He Ser Thr Ser Phe Phe Pro He 

230 235 240 

TAC ACA CAA AGA AGT TTC CAC GAA GTC TTT CAC AAC CAT GAT GTG GTT 1065 
Tyr' Thr Gin Arg Ser Phe His Glu Val Phe His Asn His Asp Val Val 

245 250 255 

ACT TAT GAC ACC ACT TTT TTT GGT GGG TCT TCT GGA TCC CCA GTA TTT 1113 
Thr Tyr Asp Thr Thr Phe Phe Gly Gly Ser Ser Gly Ser Pro Val Phe 
260 265 270 

GAT TCT AAT GGT TCA TTG GTG GCC ATG CAT GCT GCT GGC ATC ACT TGT 1161 
Asp Ser Asn Gly Ser Leu Val Ala Met His Ala Ala Gly He Thr Cys 
275 280 285 

ACA GAC CAG AAT GGA GTT TTT AAT ATC ATT GAG TTT GGT TTT ACT ATG 1209 
Thr Asp Gin Asn Gly Val Phe Asn He He Glu Phe Gly Phe Thr Met 
290 295 300 305 

GAA TCC ATT CTT GCT AAT ATT AAG CAA ACT AAA GAG TGG TAT GAT ACC 1257 
Glu Ser lie Leu Ala Asn He Lys Gin Thr Lys Glu Trp Tyr Asp Thr 

310 315 320 

ATT TTT GTA AAT GAT CGG GAT GCA GAA ATG CTG AGC ACA GAC TCC 1302 
He Phe Val Asn Asp Arg Asp Ala Glu Met Leu Ser Thr Asp Ser 

325 330 335 

TGAGGACTGG AAAGGGTGTA TTCTAGTTCA CAACTTTAAG GGAATTGCCT AAGGTGTTTT 1362 
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TCTTCAGATA 


ATAATGTTTT 


ATGAACTTTC 


AAAATGATTA 


ATTTCACCCA 


ATAATCATAG 


1422 


TTCTAGAGTG 


TATCTTAACT 


ATCAAGCATT 

1 


TTCATGGTTT 


ATTAAAACAA 


ACAATCAAAA 


1482 


ACAAAACTAT 


CAGTTGAACT 


ATTTGTGTAA 


AGTAGCATGA AAAAGAGGAT 


TAAGTGCTGA 


1542 


TGTGGGATCC 


GAGGGTCAGG 


AGATGCTAAG 


ATGTGAAAAA 


GTTGAGAGCA 


TAAACTGGAG 


1602 


TTATTAGCCA 


GAAGATTCAT 


GGAAAAGAAA GAAGCCTACC 


ACGTTCCTTG 


AAACTTAAAC 


1662 


CAATTTCAAA 


CTCACTCATT 


TGGTTCTACC 


CATATCTTCC 


TTCTCTGCTG 


ACCAGAGCTC 


1722 


AGCCAGGACA 


CTGAATCACA 


GCGACTTAAC 

\ 


CCTCCCAGGG 


CCCTGAACAT 


GGTGGCATTT 


1782 


CTTCCATTTT 


GTCTTTTTAA 


l 

CTTAAGACTT 


AGATCTGAAA 


TGAACCCACA 


TGAAGTGGCA 


1842 


TGGAAATCAT 


tgagcaAgca 


AGGAGCTGGT 


CTCTGCCACA 


GAATGCATGT 


ACCTGAGACT 


1902 


TTCTGGACTT 


gagttgtgga 


GAGCAAACAT 


TGGAATAGGT 


GGTTTCTACT 


TGATACGATT 


1962 


CACAATGGAG 


agtgctagga 


GGATATTGGG 


CACAACCTAG 


GACAATTCTC 


AGCAGTGTGA 


2022 


CACTCAAGTC 


ttggtccttt 


GAAGAGAAAG 


AAAACTGCGT 


TCAAGAAGTA 


ATCTCATCAA 


2082 






TATGAACAGA 


AGACCAGAGA 


TCTTAGATTA 

i 


2l(2 & fill 


0 1 AO 


AGTAGGAATA 


gatgagIatgc 


TGAAGGAATG 


TTGCTGGGAT 


CCAGCGATTG 


CTTCAATTCT 


2202 


TCCTTTATCA 


taactgaaat 


ATGTCCTGTA 


TGAGAGCTAA CATGATCCTA 


ATCCACAACT 


2262 


CCCCTTATTT 


tccagacatt 


TCCTAGTGCT 


ATTGTGTGAT 


ATTTTGTGTC 


CCCTGCCCAA 


2322 


TGTATTTTTC 


ccaaatcaaa 


TACAGGTGAA 


TCAATTAAAA 


AACAGTGTAA 


TATCTGTAGT 


2382 


AGGTTTTGAA 


ATATAAGATG 


AAAAAATTAA 


AAAGTCGAC 






2421 



(2) INFORMATION FOR SEQ 10 NO: 90: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 336 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 90: 

Met Gin Asn Ser Gly Leu Glu Asn Arg Asp Phe His Lys Leu Glu Asp 
15 10 15 

Tyr lie Val Asn Ala Asn Pro Thr Leu Lys Glu Glu Gly Glu Lys Leu 

20 25 30 

Arg Ala Tyr lie Lys Lys Glu Ser Lys Lys Arg Lys Lys Gly Ser Leu 
35 40 45 

Phe Lys Val His Lys Glu His Phe Gly Lys lie Thr Lys Asn Ser Thr 
50 55 60 

Pro Val Lys Val His Lys His Leu Ser Lys Val Ser Asp Ser Val Gly 
65 70 75 80 
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Phe Leu Trp Trp Asn Aen Asn Gly Lys Glu Gly Cys Ala Thr Cys Phe 

85 90 95 

Val Phe Lys Gly Leu Tyr He Leu Thr Cys Arg His Val He Thr Ser 

100 105 HO 

He Val Gly Glu Gly He Asp Pro Cys Gin Trp Ala Ser Leu He Ser 
115 120 125 

Gin Cys Val Lys Val Thr Phe Gly Tyr Glu Glu Phe Pro Leu Val Glu 
130 135 140 



Asp Lys Phe Phe Glu Val Lys Pro Trp Phe 
145 150 ' 

Leu Asp Tyr Ala Val Leu Glu Leu Arg Glu 

, 165 170 

Ala Gly Leu Tyr Asn Gly He Gly Pro Val 

180 185 

i 

Tyr He He Gly His Pro Asp Gly Glu Lys 
195 200 

Ala Val Val Pro Gin Ser His Arg Gly din 
210 • 215 

Gin Ala Arg Glu Lys Ala Asn Tyr Trp He 
225 230 

He Tyr Thr Gin Arg Ser Phe His Glu Val 

245 250 

Val Thr Tyr Asp Thr Thr Phe Phe Gly Gly 

260 265 

Phe Asp Ser Asn Gly Ser Leu Val Ala Met 
275 280 



Glu He Ser Asn Glu Ser 
155 160 

Asn Gly Gin Glu Val Pro 

175 

Pro Leu Gly Gly Leu He 

190 

Lys Ser Thr Asp Gly Cys 
205 

Lys Cys Gin Glu Asn Phe 
220 

Ser Thr Ser Phe Phe Pro 
235 ( 240 

Phe His Asn His Asp Val 

255 

Ser Ser Gly Ser Pro Val 

270 

His Ala Ala Gly He Thr 
285 



Cys Thr Asp Gin Asn Gly Val Phe Asn He He Glu Phe Gly Phe Thr 
290 295 300 

Met Glu Ser He Leu Ala Asn He Lys Gin Thr Lys Glu Trp Tyr Asp 

305 310 315 320 

Thr He Phe Val Asn Asp Arg Asp Ala Glu Met Leu Ser Thr Asp Ser 

325 330 335 



(2) INFORMATION FOR SEQ ID NO: 91: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 836 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 
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(vii) IMMEDIATE SOURCE: 
(B) CLONE: HW097 

(ix) FEATURE: 

(A) NAME /KEY: misc_£ eature 

(B) LOCATION: complement (203.. 553) 

(D) OTHER INFORMATION: /label= SAC„23978 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 91: 

GTCGACTATT ACTATAGCCT TGTATATACT CCCCTCCGGC CCTGTTTGTC TCAGTCCCAT 60 

TCCCTCTTGT CTCTGAGAAT CATCCCCCAG CCCCAAGTTC 1 TCCCTTCTGT CTCCCCCCAC 120 

CCCCCCTGCC TCCCATTCTC CTGTCTCTGA ATGCCTTTGC CTGTATAAAG AGTTGGATTC 180 

TCCCCTGGTG TCTGTACTGT GTACACACAT CCCTTTGAGA AGCACAAGGA GATGACACGC 240 

GCATTGTAAC CTTCACACTG TCTCGGTGGC GACATAAAGG ^AGCTGTGAA TTACAAGCTC 300 

TGCCTCTTTC TGGCCTCTCC CTCGGCCCCC ATGACCGGGG CACTCTCTGC CCTCCCCACG 360 

GCCTTAACAT GTTCCTGCTC CACCTATCCT GGTGCCCTTT GTCTGGCTGA CTGTAGCCTC 420 

CCTGAGGGAA GGGATTGCTG GAGAGGTAGC TCCTATCCAG GAATGAAGGA CACTGGCTGG 480 

AAACTAAGCC AAGAATGTTA CAGACAGGGG GAGCTGGGGA TTGGTGACTG ACCCTTGCAG 540 

i i 

TTGGGCAGTG AGTACCCAAG GCCAGAAGCT GAAGCCCTCA TTCATTCAGT GGCCTTTCTT 600 

TCCTCAGGGT GCCCAGTGCC CTGTCCTCAC TTTGATGCCT AGGCCCACTG CTTGCATTTT 660 

CTGGTGAGCC AGGGCTTGGT CATGTGAGGA AAGAGGTAGG CCTAGAGCAG GAGTTGAAAG 720 

CCCTGCTGTT GTGTGTCCCT ACCATACAGG GCTGGCTAAA AGTCGAGGTA GACGCTTATC 780 

TAGGAAGCGC AAGGCCCTGG GTTCGGTCCC CAGCTCGGAA AAAAAAGAAC CAAAAA 836 
(2) INFORMATION FOR SEQ ID NO: 92: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 318 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: SAC23798 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 92: 
GGGGGATCCG CCCGATCCAA GACTGCCCAG GGTGGCTGCT CCATCTCACC CAGAGCAACC 60 
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GGAGCTGGAC CTGGGGGTCG GAGCACTTCG AAGGTGATGG GTTATGGTGC CGCCTGCTGG 120 

i 

ACATAGTTGT CCAGAGCTCT TCCTGAAAGG TTTGCTACAG GATTCGCCCT TCTCATCCAG 180 

TCTAAGAAAC TACTTAGTCA ACTCCTAGTG TCTAAGCAGA AGGCTTCAGC TATGGCTTCT 240 

CCCCTTGGGG AGGAAGAGGA AGAGGTACAA ATTCTTGTTT TGCTGCTCCC AGGGAAAAGA 300 

TCGGGCTAGA GCGGCCGC 318 
(2) INFORMATION FOR SEQ ID NO: 93: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 278 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

i 

(iii) HYPOTHETICAL: NO 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: SACJ23800 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 93: 

> i 

L 

GGGGGATCCG CCCGATCACA CGGCTGCCTT CTGGTATACA GCAGAGGGGA ATGCCAAGGT 60 

ATAAGTATCC TTCAATTCCT TCTTTGCCAC ATACCAGCAT TTCTTTGCCC TTTTGGCATT 120 

CTACCTCTTC TTTGACAGCA CTGAGACTGT TAAGTGCTGC ATACAAGCTA CTTGTCTCCC 180 

TATCTGTNNG CACATGTTTC ATGTCATGAA GATGATAAGG TCAGGCAAAG AGGCATGAAG 240 

ATTCCTGAAT AGTGCCCAGA TCGGGCTAGA GCGGCCGC 278 
(2) INFORMATION FOR SEQ ID NO: 94: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 393 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: SACJ23801 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 94: 
GGGGGATCCG CCCGATCCAG TAGCGACAGC CCTGGCCCCT GACGAGGATG GAAGTGACGT 60 
TGTAGCNATT GTCTTCCTGT AGCTCATAGA TGGTGCTGTN NATGGTAAAG CGGCTTTGTC 120 
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TTTCTTTCTG GACCGCATTG CCTGCCAGGN NGACAACGAA CCACCTGCCC TGGAACCGTT 180 

CGGTCCAGAA GCCTGGCTGC AGGGGCACAC TGATCGATCC AGTAGCGACA GCCCTGGCCC 240 

CTGACGAGGA TGGAAGTGAC GTTGTAGCTA TTGTCTTCCT GTAGCTCATA GATGGTGCTG 300 

TACATGGTAA AGCGGCTTTG TCTTTCTTTC TGGACCGCAT TGCCTGCCAG GCCGACAACG 360 

AACCACCTGC CCTGGAACCG TTCGGTCCAG AAG 393 
(2) INFORMATION FOR SEQ ID NO: 95: 

{i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 531 base pairs 

(B) TYPE: nucleic acid ( 

(C) STRANDEDNESS : double . 

(D) TOPOLOGY; linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO , 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: SAC.23833 , 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 95: 

GGGGGATCCG CCCGATCTGT ATCTCGTCCC TTGGCTGGCT GAGCCTCTGG AGGAGGTGTC 60 

CAAAGCCCCG GTGGATTTCT GTCTCAGGGG TCTCTGTGAG ATTGAACTTG AGACCTTCTA 120 

GAATCTCTTC CATGCTGTTG CCCTTTGCTC CCAGGGACAC GACGGCCAAG GCGGCTGAGA 180 

TGCTAAGTGG GGAGAAGACA ACATTTTTAT CTGGATTCCT CAAAGCCAGC TTCCTGTAGA 240 

GGCTGAAGGC AAAGTCAGTA TTGATGGAAG CCAGTGTGAG ACTGTCCAGT TGTGTCCCCT 300 

TGTCTTGGTC TTCATGGAAT AGAGTGTCCT CTCCCAGTAT GCAAATCTGG GAAGCAGAGG 360 

AGAGCAGAGC CCGATCGATC AAGATGACTA AGATGCTCAA AGGATTCGAC GCTGTGGGAA 420 

ATGCCACAGG TTTCCGGTTC GCCTACACCC CAGCTATGGA GAGCCTCTGT GGATATGTCA 480 

CAAGTCCAGA ACCGCAGCGA GGAGTTTCTC ATCGCGGGCC GTTTAAGGAC G 531 
(2) INFORMATION FOR SEQ ID NO: 96: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 274 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 

(vii) IMMEDIATE SOURCE: 
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(B) CLONE: SAC_23836 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 96: 

GGGGGATCCG CCCGATCATG ATGAATCCCC TGGGACCAAG CTATAACACA CAGTCCTCTG 60 

AGAAAAAGGA TTGTGGTTTG CAGCAAATTC AGCCAATTTT GTCAAGCAAT TTGTGAGAGA 120 

CACTCCTTAC TTGGTTGTTT CCTTCTCAGT CTTACGGGGT GCCTTATCAC TCCGTGATGG 180 

TTCATGGCCA CACAGTCATA TTTCAGGGAG AAGTCCTTGT CGGTCACACC AGTTATCCTT 240 

AACAGTGAGG TTAAGCAAAT CATGCCATTG CTGG 274 
(2) INFORMATION FOR SEQ ID NO: 97: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 254 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double ( 

(D) TOPOLOGY: linear 

I ii) MOLECULE TYPE: cDNA 

! 

(iii) HYPOTHETICAL: NO 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: SACJ23853 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 97: 

GGGGGATCCG CCCGATCGTG TTATTGAGGT CATCCACGGT GGAGATTAGC AGGTGACTCC 60 

GTTTTCCTGG AAATGAGTGG AGTTGGGAAG GGGACCCGTG GCCGTGCTCC AGCTATAGTT 120 

GGTGGGCGGT GGTTCGCTGC GAGCTTCACA GGTCAGGTTC ACGTTAGTAG GCCCAATGTA 180 

CCAGTTGCCT TCATAGCCAG AGATGGACAC TTCGGGTGGA TAAGGTAGGG AAAGGATCGG 240 

GCTAGAGCGG CCGC 254 
(2) INFORMATION FOR SEQ ID NO: 98: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 255 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: SAC_23906 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 98: 

GGGGGATCCG CCCACAGAAC CCAGAGGAAG GAGAGGCTGC TGGGGTGGAG GCCTAGGCGC 60 

TGGAGACATG TGGAGTTCTC TAGGGGTCTG CAGCAACCTC GGAAAGCTGG GAGATTCCCT 120 

CCTTGAGACT CCTACATATA GAAAACTGAT GCTTCTGTCT CATTCCATGC GGCTTTTCCT 180 

GCGGTATTCC TGTAGCGCTT TCTCTGCCAC TGTGTCCATA AACTTAGGGT TATCCTTGGA 240 

GACTTCTTCT GGTAC 255 
(2) INFORMATION FOR SEQ ID NO: 99: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 219 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNE SS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 
(iii) HYPOTHETICAL: NO 



(vii) IMMEDIATE SOURCE: 

(B) CLONE : SAC_23907 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 99: 
GGGGGATCCG CCCACCCCAT CTAATCCTCT GAGGCTCAGG GTGACTCATT CTTCCTTGGA 60 

CCCAGGGGCG GGGCCGACAT TTGTTCCCCC AGTTGGGCCC TGCTGCCCCA GGCCAGTGGC 120 

ATTCCCAGTG TCTCCAGCCA CTAAAGCCAC ATTCCTCAGG TAGTTGGTAT TGAAGCAGTT 180 

GGCTTGTTTA TCTCCAGGAG ACAAGGTTCG AGCGGCCGC 2 IS 
(2) INFORMATION FOR SEQ ID NO: 100: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 182 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: SAC_23916 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 100: 
GGGGGATCCG CCCTCGAGCG GCCGCCCACA AGAGATTGAA CCTAGAGCTG TGTGCCTCCT 60 
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GGGCAAGCAC TCTCTACTGA GCTACATCCC TGTTAAAGTG CCTTTTTTGG GAGCTTTGTC 120 
TTCCAGCCTG CCAATCAACC CACTTTATGG GTGTGCCTAG ATTCCCCTTT CTCTGAGTAG 180 

GG 182 
(2) INFORMATION FOR SEQ ID NO: 101: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 191 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 



(vii) IMMEDIATE SOURCE: 

(B) CLONE : SACJ23922 



{xi) SEQUENCE DESCRIPTION: SEQ ID, NO: 101: 

GGGGGATCCG CCCACTGTTC TCAAACAGTT GCCTAACAAT ACATTTAATT AAGAAATTAT 60 

CATAACTGTA TGCTTACACA TATGTGGATA GGCATCAATG AAGATTACAA CAGCTCAGCC 120 

ACAGTGCTGC AATGCTCTAC ACTACTACAA AACACCCACC ACGGGCCTGN NACCTGCCCT 180 

CGAGCGGCCG C 191 
(2) INFORMATION FOR SEQ ID NO: 102: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 234 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 



(vii) IMMEDIATE SOURCE: 

(B) CLONE : SAC_24030 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 102: 

GGGGGATCCG CCCACCATAT CTATACGGCC CTGGAAGAGG AGATAGAGCG AAACAAGCAG 60 

AACCCAGTCT ATGCCCCGCT CTACTTNCCT GAGGAGCTGC ACCGAAGGCT GCCCTAGAGC 120 

AGGACATGGC CTTCTGGTAT GGGCCCCACT GGCAGGAGGC CATCCCTTAN NNACCAGCCA 180 

CACAGCACTA CGTAAAGCGT CTCCACGAGG TGGGAGGTGG GCTAGAGCGG CCGC 234 
(2) INFORMATION FOR SEQ ID NO: 103: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 172 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: SAC_24078 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 103: 
GGGGGATCCG CCCGATCAAG ATGACTAAGA TGCTCAAAGG ATTCGACGCT GTGGGAAATG 60 
CCACAGGTTT CCGGTTCGCC TACACCCCAG CCATGAGAGC jCTCTGTGGAT ATGTCCACAA 120 
GTCCCAGAAC CCGCAGCGAG GAGTTTCTCA TCGCGGGCCG TTTAAGGAAC GG 172 
(2) INFORMATION FOR SEQ ID NO: 104: i 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 241 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: SAC_24105 



(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 104: 

GGGGGATCCG CCCGATCTGT GCATGTGAGC CGAACAGAAG TCAGCAATAA CCATGTCTTG 60 

ATTTACCTGG ATAAGGTGTC AAATCAGACG GTGAACTTGT CCTTCACGGC TCAGCAAGAT 120 

ATTCCAATAA GGGACCTGAA GCCAGCCGTA GTGAAAGTCT ACGATTACTA TGAGAAAGAT 180 

GAGTTTGCAG TTGCAAAATA CAGCGCTCCC TGCAGCACAG ATTATGGAAA TGCCTGAGGA 240 

C 241 
(2) INFORMATION FOR SEQ ID NO: 105: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 234 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 

(vii> IMMEDIATE SOURCE: 

(B) CLONE: SAC_24140 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 105: 
GGGGGATCCG CCCGATCCTT AAAGGTCCCA TAACCAAGTA CTCCATGCAG TCATCTCCTC 60 
TAACCTGTGG GTCCCCTCCA TTCATTGCAA GCTGCTGGAC, ATAGCCTGCT GGGTCCACCA 120 
CAAGTACAAC AGTGACAAGT CCAGCACCTA TGTGAAGAAT GGCACATCCT TCGACATCCA 180 
CTACGGCTCA GGTAGCCTCT CTGGGTACCT GAGCCAGGAC ACTGTGTTGG t TTCC 234 
(2) INFORMATION FOR SEQ ID NO: 106: 

l 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 217 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double ! 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 1 1 



(vii) IMMEDIATE SOURCE: 

(B> CLONE: SAC„24142 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 106: 

GGGGGATCCG CCCGATCTAT CCGGGTATAT TCGGAACTGT GGAGTCTGAC CGCAGTGCAC 60 

TGCTACTCAG CTCACTCTGT CTCTTTCTCA CTGTGGGAGT AGCTATGGGA AAGGTCACAC 120 

TTCCTTGCTT TCCCCTCAAA CCAACAGGGC TTTCACTTCT CAGTCTCCCA ACGCACTGCA 180 

CACTGCTCCC AGCATCCCGC ACAGTCTCAC CACACAC 217 
(2) INFORMATION FOR SEQ ID NO: 107: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 172 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: SAC_24192 
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(xi) SEQUENCE DESCRIPTION: SEQIDNO:107: 
GGGGGATCCG CCCACATCTC AGCGGTGGGG ACTCAGACAT TCCTGTGTTC CCTCCTTGGC 60 
CTCCAGCCTC TCTGTAGGAA CCTCCAGCAG CCTGCCACCA GATTTCCCTT AGCTTCCACT 120 
GTCTCCATGA GCTTTAAAAT GTGGGCGGCC GCTCGAGGGC TAGAGCGGCC GC 172 
(2) INFORMATION FOR SEQ ID NO: 108: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH : 212 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

i 

(iii) HYPOTHETICAL: NO 



(vii) IMMEDIATE ^OURCE: 

(B) CLONE: SAC.24296 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 108: 

GGGGGATCCG CCCACTTGGC TGACCGCCAG GCGAGTGACT GTGACTCCAG GTTCGCCCAC 60 

TACCGCAGCA GATACTGCCT CGCCTATGCC ACCTTCATAG TAGTGGTCCT CCACGGTGAG 120 

GATCCTGCCT TTGGTTGCTC TGGCACAGTC GAGAATGAGC TTTTTGTCCA GGGGCTTGAT 180 

GGTGAAGGGG TCCAGTGGGC TAGAGCGGCC GC 212 
(2) INFORMATION FOR SEQ ID NO: 109: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 210 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: SAC_24445 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 109: 
GGGGGATCCG CCCACAGCAT TCGGGTATGA GGCAAGGGGA GTGTTCATTC ACACACACAG 60 
TAGCTTCGGG TCTGTGAGGT CTCCTTGTTC CCGGGTAGGT TTCTAAAGAC GGAGAAAAAA 120 
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ATGATTCTGG TTATCAAGAC TACTGTGACC GTATTAGATC CCAGAATGGG CAAGCATCAG 180 
TGTGTGACCA TGCGAACAAA AGGAATTTTT 210 
(2) INFORMATION FOR SEQ ID NO: 110: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 200 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 



<vii) IMMEDIATE SOURCE: 

(B) CLONE: SAC.24527 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 110: 

f 

GGGGGATCCG CCCACCATGG TCTCACTCAG AGACGGGTTT GGTCAGTTTT AAGATGAACT 60 

i 

CCTTCGGATG TAGTGTCCGT AAGTCCTTAT TATCGCTGTC CATCAATTCA CGATAGGATA 120 
GCTGGCTGCG GTGGCAAGTC CGCAGTGGTT GTTCCACCTG CCCACATTAT TCTCTCGTGG 180 
CCATGGTTCT TCCCTTAGAG ( < 200 

(2) INFORMATION FOR SEQ ID NO; 111: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 400 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: SAC_24540 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 111: 

GGGGGATCCG CCCACAGTGG GAAGGATGGA AATAGCTGTC ATCTCTCCAA AACTGAGTTC 60 

CTTTCCTTCA TGAACACGGA GCTGGCCGCC TTCACGAAGA ACCAGAAGGA CCCCGGTGTC 120 

CTCGACCGCA TGATGAAGAA GCTGGACCTC AACAGTGATG GGCAGCTAGA TTTCCAAGAG 180 

TTTCTCAACC TTATTGGTGG CTTAGCTATA GCACCTGCCC ACTCTCGTGG CTGGCCCTCT 240 

GCTTATACTC CTTGGACTGC TCCAGGCTGT GTGTTTCCAC GCTTGACCTG CCCACAGAGC 300 

CAAAAGAGAC ACATTGGCTA CTTTAACCAC CTTAAAGCGG ACTCCAGGAA TATCACCCAC 360 
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GGCATGGCCT TTTGCGAACA AATCCCANCA ACCAAAAAAA 400 
(2) INFORMATION FOR SEQ ID NO: 112: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 200 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
(D> TOPOLOGY i linear 

(ii) MOLECULE TYPE: cDNA 

i 

(iii) HYPOTHETICAL: NO 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: SAC„24623 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 112: 

GGGGGATCCG CCCACAGTGC ATATTGGCGG CGCTCGCCTC ATTACGATTC GCCAGCTTGC 60 

TTCTCTTGTT CAATTGTTTC TTTTGAAGGC AGAGGATTTT TCTCTTGTGT TTCTGTCTTC 120 

TTCAACTTCG ACTTATfcGAA TTTCACCTGC CCACTAAACC CACAGCCCAG CCCTCCTTCC 180 

TTCCTCCTGT CCTGGAATTG 200 
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What is claimed is: 

1 . Purified and isolated nucleic acid encoding all or a unique fragment of a KIM. 

2. Nucleic acid of claim 1 that is RNA. 

3. Nucleic acid of claim 1 that is DNA. 
5 4. Nucleic acid of claim 3 that is cDNA, 

i 

5. Nucleic acid that is complementary to the nucleic acid of claim 1 . 

6. Nucleic acid that hybridizes to the nucleic acid of claim 1 or 5. 

7. Nucleic acid of claim 1 that is in operative association with an expression control 
element therefor. 

10 8. Antisense nucleic acid of claim 1 that is sufficient, when internalized within a cell, 

■I 

to disrupt expression of a cellular KIM gene. 

9. Purified and isolated nucleic acid encoding all or a unique fragment of a KIM, said 
nucleic acid having a sequence selected from the sequences listed in TABLE 1. 

10. Purified and isolated nucleic acid encoding all or a unique fragment of a KIM, said 
15 nucleic acid having a sequence selected from degenerate variants of the sequences 

listed in TABLE 1. 

1 1. Purified and isolated nucleic acid encoding all or a unique fragment of a KIM, said 
nucleic acid having a sequence selected from homologous variants of the sequences 
listed in TABLE 1. 

20 12. Purified and isolated nucleic acid encoding a KIM fusion polypeptide, the sequence 
of said nucleic acid comprising a KIM sequence selected from the sequences listed 
in TABLE 1 fused in frame to a sequence encoding a non- KIM polypeptide. 

13. Nucleic acid of claim 12 wherein the encoded non-KIM polypeptide is selected 
from: a secretable leader polypeptide, an immunoglobulin polypeptide, a binding 
25 pair partner, a toxin or toxoid, an enzyme, and a detectable polypeptide. 
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14. A vector comprising a KIM-encoding nucleic acid of any one of claims 1 to 13. 

15. A host cell stably transformed or transfected with a vector of claim 14. 

16. A method for producing a KIM polypeptide, comprising the steps of growing a host 
cell of claim IS under conditions sufficient for the production of polypeptides of 

5 vector origin; and, recovering an expressed KIM polypeptide. 

17. A purified and isolated polypeptide comprising all or a unique fragment of a KIM, 
said polypeptide having a sequence selected from the sequences encoded by nucleic 
acids listed in TABLE 1. 

18. A purified and isolated polypeptide comprising all or a unique fragment of a KIM, 
10 said polypeptide having a sequence selected from variants of the sequences encoded 

by nucleic acids listed in TABLE 1. 

19. A polypeptide of claim 18 wherein said variant is a splice variant, truncation 
variant, or substitution variant of an encoded TABLE 1 sequence. 

20. A KIM fusion polypeptide, the sequence of said polypeptide comprising all or a 
15 unique fragment of an encoded KIM sequence listed in TABLE 1, fused to a non- 

KIM polypeptide sequence. 

21 . A fusion polypeptide of claim 20 wherein the non-KIM polypeptide sequence is 
selected from: a secretable leader polypeptide sequence, an immunoglobulin 
polypeptide sequence, a binding pair partner sequence, a toxin or toxoid sequence, 

20 an enzyme sequence, and a detectable polypeptide sequence. 

22. A conjugate comprising a KIM polypeptide of claim 17, 18 or 19, linked to a 
detectable moiety, imageable moiety, radiolabeled moiety, or toxin moiety. 

23. A KIM fusion polypeptide, the sequence of said polypeptide comprising all or a 
unique fragment of an encoded KIM sequence selected from nucleic acid sequences 

25 listed in TABLE 2, fused to a non-KIM polypeptide sequence. 

24. A fusion polypeptide of claim 23 wherein the non-KIM polypeptide sequence is 
selected from: a secretable leader polypeptide sequence, an immunoglobulin 
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polypeptide sequence, a binding pair partner sequence, a toxin or toxoid sequence, 
an enzyme sequence, and a detectable polypeptide sequence. 

25. A conjugate comprising a KIM polypeptide of claim 23 or 24, linked to a 
detectable moiety, imageable moiety, radiolabeled moiety, or toxin moiety. 

5 26. An antibody that binds selectively to all or a unique fragment of a KIM polypeptide 
encoded by a nucleic acid sequence listed in TABLE 1 . 

27. A conjugate comprising an antibody of claim 26, linked to a detectable moiety, 

i 

I 

imageable moiety, radiolabeled moiety, or toxin moiety. 

i 

28. An antibody that binds selectively to all or a unique fragment of a KIM polypeptide 
10 encoded by a nucleic acid sequence listed in TABLE 2. 

29. A conjugate comprising an antibody of claim 29, Jinked to a detectable moiety, 
imageable moiety, radiolabeled moiety, or toxin moiety. 

30. Use in therapy of an antisense KIM nucleic acid of claim 8. 

31. Use in therapy of a KIM vector of claim 14. 

15 32. Use in therapy of a KIM polypeptide of claim 17, 18 or 19. 

33. Use in therapy of a KIM fusion protein of claim 20 or 21, or of a KIM conjugate of 
claim 22. 

34. Use in therapy of an antibody of claim 26, or of an antibody conjugate of claim 27. 

35. A pharmaceutical composition comprising a physiologically acceptable carrier 

20 having dispersed therein, to a therapeutically effective concentration: a KIM nucleic 

acid of claim 8; a KIM vector of claim 14; a KIM polypeptide of claim 17, 18 or 
19; a KIM fusion protein of claim 20 or 21; a KIM conjugate of claim 22; an anti- 
KIM antibody of claim 26; or, an anti-KIM antibody conjugate of claim 27. 

36. A method for detecting the presence or status of renal injury or of renal disease, 
25 comprising the step of measuring the concentration of a KIM polypeptide in a 
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10 



15 



20 



23 



37. 



38. 



39. 



40. 



41. 



42. 



sample selected from serum, urine or urine sediment of an individual afflicted with, 

i 

or at risk of developing, renal injury or an impairment of renal function. 

A method for detecting the presence or status of an abnormal response to tissue 
injury or other stimulus, comprising the step of measuring the concentration of a 
KIM polypeptide in a sample selected from serum, urine or urine sediment of an 
individual afflicted with, or at risk of developing, an autoimmune response or 
abnormal tissue growth arising from or affecting renal tissue. 

A method according to claim 36 or 37, wherein said measuring step involves 
contacting said sample with an anti-KIM antibody of claim 26 or 28, or with a 
conjugate of claim 27 or 29. 

i 

A kit for practice of the method of claim 36 or 37, comprising at least one reagent 
selected ftpm: the anti-KIM antibody of claim 26; the anti-KIM antibody of claim 
28; the conjugate of claim 27; and, the conjugate of claim 29. 

A method for detecting the presence or status of renal injury or of renal disease, 
comprising the step of measuring the level of a KIM nucleic acid in a sample 
selected from serum, urine or urine sediment of an individual afflicted with, or at 
risk of developing, renal injury or an impairment of renal function. 

A method for detecting the presence or status of an abnormal response to tissue 
injury or other stimulus, comprising the step of measuring the level of a KIM 
nucleic acid in a sample selected from serum, urine or urine sediment of an 
individual afflicted with, or at risk of developing, an autoimmune response or 
abnormal tissue growth arising from or affecting renal tissue. 

A method according to claim 40 or 41, wherein said measuring step involves 
contacting said sample with a KIM nucleic acid of claim 9, 10 or 1 1; or with a 
nucleic acid probe that hybridizes to KIM nucleic acid of claim 9, 10 or 11; or with 
a nucleic acid having a sequence selected from sequences listed in TABLE 2; or 
with a nucleic acid probe that hybridizes to a nucleic acid listed in TABLE 2. 
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43. A kit for practice of the method of claim 40 or 4 1 , comprising at least one reagent 
selected from: the KIM nucleic acid of claim 9, 10 or 1 1; or a nucleic acid probe 

i 

that hybridizes to KIM nucleic acid of claim 9, 10 or 1 1 ; or a nucleic acid having a 
sequence selected from sequences listed in TABLE 2; or a nucleic acid probe that 
5 hybridizes to a nucleic acid listed in TABLE 2. 

i 

44. A method of imaging cells or tissue expressing or producing a KIM, comprising the 
steps of contacting the cells or tissue with an imageable KIM-binding reagent; and, 

imaging an accumulation of the KIM-binding reagent. 

/ 1 

45. A method according to claim 44 wherein the cells or tissue are disposed in body 
10 tissue of an individual suspected of harboring an imageable locus of KIM 

expression and/or production. 

i i 

46. A method of locating a tissue mass abnormally producing or expressing a KIM in 
an individual suspected of affliction with an abnormal tissue response to injury or 
other stimulus, comprising the step of: administering an imageable KIM-binding 

15 reagent to the individual; and, imaging said tissue mass within the body of the 

individual. 

47. A method according to claim 44 wherein the tissue mass is a tumor arising from or 
affecting renal tissue of the individual. 

48. A method of treating an individual afflicted with, or at risk of developing, a disease 
20 or condition contributed to or associated with a dysfunction or dysregulation of a 

KIM gene or protein, comprising the step of administering to the individual a 
pharmaceutical composition of claim 35. 

49. A method of treating an individual afflicted with, or at risk of developing, a disease 
or condition beneficially affected by therapeutic administration of a KIM 

25 composition of the present invention, comprising the step of administering to the 

individual a pharmaceutical composition of claim 35. 
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50. A method according to claim 48 or 49 wherein the disease is a renal disease; or, the 
condition is an impairment of renal function. 

5 1 . A method of inhibiting the growth of KIM-expressing tumor cells, comprising the 
step of contacting the tumor cells with a pharmaceutical composition comprising 
involving the step of contacting the ceils with: an antisense KIM nucleic acid of 
claim 8; a KIM fusion protein of claim 20, 21, 23 or 24; a KIM conjugate of claim 
22 or 25; an antibody of claim 26 or 28; or an antibody conjugate of claim 27 or 29, 

52. A method according to claim 36, 37, 40, 41, 45< 46, 47, 48, 49, or 50 wherein the 
individual is a human. 
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